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Chapter 1 


Vector Spaces 


1.1 Introduction 


1. (a) No. See 
(b) Yes. -3(-3,1,7) = (9,-3,-21) 
(c) N 
(d) N 


2. Here ¢ is in F. 
(a) (3,-2,4) + ¢(-8,9,-3) 
(b) (2,4,0) + t(—-5, -10, 0) 
(c) (3,7, 2) + t(0,0,-10) 
(d) (—2,-1,5) + €(5, 10, 2) 


3. Here s and ¢ are in F. 


(a) (2,-5,-1) + s(-2,9,7) + ¢(-5, 12,2) 
(b) (3,-6,7) + s(-5, 6,-11) + ¢(2, -3, -9) 
(c) (-8, 2,0) + (9, 1,0) + ¢(14, 3, 0) 

(d) (1,1,1) + s(4,4, 4) + t(-7, 3,1) 


4. Additive identity, 0, should be the zero vector, (0,0,...,0) in R”. 


5. Since a = (a1, a2) — (0,0) = (a1, a2), we have ta = (ta,,taz2). Hence the 
head of that vector will be (0,0) + (tai, taz) = (tar, tag). 


6. The vector that emanates from (a,b) and terminates at the midpoint 
should be $(c-a,d-b). So the coordinate of the midpoint will be (a,b) + 


$(c-a,d- b) = ((a+c)/2, (b+ d)/2). 


7. Let the four vertices of the parallelogram be A, B, C, D counterclockwise. 
Say « = AB and y = AD. Then the line joining points B and D should 
be « + s(y-—), where s is in F. And the line joining points A and C 
should be t(a + y), where t is in F. To find the intersection of the two 
lines we should solve s and t such that «+ s(y-ax) =t(x+y). Hence we 
have (1-s-t)x =(t-s)y. But since x and y can not be parallel, we have 
1-s-t=Oandt-s=0. Sos=t= $ and the midpoint would be the head 
of the vector 4(x +y) emanating from A and by the previous exercise we 
know it’s the midpoint of segment AC or segment BD. 


1.2 Vector Spaces 


1. (a) Yes. It’s condition (VS 3). 


(b) No. If x2, y are both zero vectors. Then by condition (VS 3) a = 
uty=y. 
. Let e be the zero vector. We have le = 2e. 


ZZ 
o 9° 


. It will be false when a = 0. 


Z 
a 


. It has m rows and n columns. 


No. For example, we have that x + (-2) = 0. 


5 . {ee} 
Ss a 

ZA 

oe) 


(k) Yes. That’s the definition. 
2. It’s the 3 x 4 matrix with all entries =0. 


3. Mi3 = 3, Moi =4, Moo =5. 


20 -12 
(©) ( 0 28 
30-20 
(d) | -15 10 } 
-5  -40 


(ce) 2x* +23 + Qn? - 22+ 10. 


(f) -a3 + 7x? +4, 


10. 


11. 
12. 


13. 


14. 


15. 


16. 
7. 


18. 


(g) 10x7 — 30a4 + 40x? - 152. 
(h) 32° - 6x? + 122 +6. 


8 3 1 9 1 4 17 4 5 
3.0 0 ]7+] 3 0 0 f=] 6 O O }. 
3.0 0 1 1 0 4 1 0 


3 1 2 6 
can describe the inventory in June. Next, the matrix 2M - A can describe 
the list of sold items. And the number of total sold items is the sum of all 
entries of 2M — A. It equals 24. 


42 1 38 
_Me= | 5 1 1 4 } Since all the entries has been doubled, we have 2M 


. It’s enough to check f(0) + g(0) = 2=h(0) and f(1) + 9(1) =6=AhA(1). 
. By (VS 7) and (VS 8), we have (a+ b)(x+y) = a(a@t+y)+0d(at+y) = 


ax + ay + ba + by. 


. For two zero vectors 09 and 0;, by Thm 1.1 we have that 09+ 2 =a =0,+2 


implies 09 = 01, where z is an arbitrary vector. If for vector x we have two 
inverse vectors yo and y;. Then we have that «+ yo = 0 = x+y implies 
Yo = yi. Finally we have 0a + la = (0+ 1)a = 1a=0+ 1a and so 0a = 0. 


We have sum of two differentiable real-valued functions or product of scalar 
and one differentiable real-valued function are again that kind of function. 
And the function f = 0 would be the 0 in a vector space. Of course, here 
the field should be the real numbers. 


All condition is easy to check because there is only one element. 


We have f(-t)+9(-t) = f(t) + g(t) and cf(-t) = cf(t) if f and g are both 
even function. Futhermore, f = 0 is the zero vector. And the field here 
should be the real numbers. 


No. If it’s a vector space, we have 0(a1, a2) = (0,a2) be the zero vector. 
But since a2 is arbitrary, this is a contradiction to the uniqueness of zero 
vector. 


Yes. All the condition are preserved when the field is the real numbers. 


No. Because a real-valued vector scalar multiply with a complex number 
will not always be a real-valued vector. 


Yes. All the condition are preserved when the field is the rational numbers. 


No. Since 0(a@1, a2) = (a1, 0) is the zero vector but this will make the zero 
vector not be unique, it cannot be a vector space. 


No. We have ((a1, a2) iF (b1, b2)) + (c1,€2) = (a1 + 2b, + 2c, a2 + 3b + 3c2) 
but (a1, a2) + ((b1, be) + (c1,€2)) = (a1 + 2b; + 4c1, a2 + 3b + 9c2). 


19. 


20. 


21. 


22. 


No. Because (c+ d)(a1, a2) = ((c+d)a1, “4)) may not equal to c(a1, a2) + 
d(a1, a2) = (ca, +de,, 2+). 


A sequence can just be seen as a vector with countable-infinite dimensions. 
Or we can just check all the condition carefully. 


Let Oy and Ow be the zero vector in V and W respectly. Then we have 
(Ov, 0w) will be the zero vector in Z. The other condition could also be 
checked carefully. This space is called the direct product of V and W. 


Since each entries could be 1 or 0 and there are m x n entries, there are 
2”*” vectors in that space. 


1.3 Subspaces 


1. 


(a) No. This should make sure that the field and the operations of V and 
W are the same. Otherwise for example, V = R and W = Q respectly. 
Then W is a vector space over Q but not a space over R and so not 
a subspace of V. 


(b) No. We should have that any subspace contains 0. 
(c) Yes. We can choose W = 0. 


(d) No. Let V=R, Eo = {0} and Ey = {1}. Then we have Eon Ey = @ is 
not a subspace. 


(e) Yes. Only entries on diagonal could be nonzero. 
(f) No. It’s the summation of that. 


(g) No. But it’s called isomorphism. That is, they are the same in view 
of structure. 


(a) ( es 3 ) wien tr=—5. 
0 3 
0) | $4 } 
-6 7 
-3 0 6 
(2, ( G:. 22s 
10 2 -5 
o ( 0 -4 7 |v tr= 12. 
-8§ 3 6 
1 
(e) | all 
5 


10. 


11. 


12; 


13. 


-2 7 
5 0 
(Q) a 4 
4 -6 
(g) (5 6 7). 
-4 0 6 
(h) 0 1 -3 | with tr=2. 
6 -3 5 


. Let M=aA+bB and N = aA'+bB". Then we have Mi; = aAyj+bBi; = Nji 


and so M'=N. 


. We have Aj, = Aj; and so Af, = Ajj. 


. By the previous exercises we have (A + A‘)! = A‘ +(A‘)! = A‘ +A and so 


it’s symmetric. 


. We have that tr(aA So bB) = eG aA; + bBy; = ade, Ai a by Bi = 


atr(A) + btr(B). 


. If A is a diagonal matrix, we have A;; = 0 = Aj; when i # j. 


. Just check whether it’s closed under addition and scalar multiplication 


and whether it contains 0. And here s and ¢ are in R. 


(a) Yes. It’s a line ¢(3,1,-1). 
(b) No. It contains no (0,0,0). 


) 
) 
c) 
) 
) 
) 


( 

(d 
( 
( 


Yes. It’s a plane with normal vector (2,-7,1). 
Yes. It’s a plane with normal vector (1,-4,-1). 
e) No. It contains no (0, 0,0). 


f) No. We have both (3, /5,0) and (0, /6, V3) art elements of We 
but their sum (V3, V5 + V6, V3) is not an element of We. 


. We have Wi nW3 = {0}, Wi nW, = Wi, and W3nNW, isa line t(11,3,-1). 


We have W, is a subspace since it’s a plane with normal vector (1,1,...,1). 
But this should be checked carefully. And since 0 ¢ W2, W2 is not a 
subspace. 


No in general but Yes when n= 1. Since W is not closed under addition. 
For example, when n = 2, (2? + x) + (-a?) = @ is not in W. 


Directly check that sum of two upper triangular matrix and product of one 
scalar and one upper triangular matrix are again uppe triangular matrices. 
And of course zero matrix is upper triangular. 


It’s closed under addition since (f + g)(so) = 0+0=0. It’s closed under 
scalar multiplication since cf(s 9) = c0 = 0. And zero function is in the set. 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


It’s closed under addition since the number of nonzero points of f+g is less 
than the number of union of nonzero points of f and g. It’s closed under 
scalar multiplication since the number of nonzero points of cf equals to 
the number of f. And zero function is in the set. 


Yes. Since sum of two differentiable functions and product of one scalar 
and one differentiable function are again differentiable. The zero function 
is differentiable. 


If f™ and g™ are the nth derivative of f and g. Then f(™ +g will be 
the nth derivative of f +g. And it will continuous if both f( and g™ 
are continuous. Similarly cf‘ is the nth derivative of cf and it will be 
continuous. This space has zero function as the zero vector. 


There are only one condition different from that in Theorem 1.3. If W is 
a subspace, then 0 « W implies W + @. If W is a subset satisfying the 
conditions of this question, then we can pick x € W since it’t not empty 
and the other condition assure 0x = 0 will be a element of W. 


We may compare the conditions here with the conditions in Theorem 1.3. 
First let W be a subspace. We have cx will be contained in W and so is 
ca +y if x and y are elements of W. Second let W is a subset satisfying 
the conditions of this question. Then by picking a= 1 or y = 0 we get the 
conditions in Theorem 1.3. 


It’s easy to say that is sufficient since if we have W, c W2 or W2 c W, then 
the union of W; and W2 will be W, or W2, a space of course. To say it’s 
necessary we may assume that neither W, c W2 nor W2 c W, holds and 
then we can find some x € W,\W, and y € W2\W,. Thus by the condition 
of subspace we have x + y is a vector in W, or in Wo, say W,. But this 
will make y = (x+y) — 2 should be in Wj. It will be contradictory to the 
original hypothesis that y ¢ W2\W,. 


We have that a;w; € W for all i. And we can get the conclusion that a,wy, 
AW 1 + GQW2, 41W) + agw2 + a3w3 are in W inductively. 


In calculus course it will be proven that {a,+6,} and {ca,,} will converge. 
And zero sequence, that is sequence with all entris zero, will be the zero 
vector. 


The fact that it’s closed has been proved in the previous exercise. And a 
zero function is either a even function or odd function. 


(a) We have (a1 + 2) + (yi + y2) = (41 + y1) + (42 + yo) € Wi + We and 
C(x, a x2) = CX, + CM € Wi + We if T15Y1 € Wi and 2,42 € Wo. And 
we have 0=0+0€W,+ Wa. Finally W, = {1+ 0:c€W1,0€ Wo} c 
W, + W, and it’s similar for the case of Wo. 

(b) If U is a subspace contains both W, and W2 then x+y should be a 
vector in U for all « « W, and ye Wo. 
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24. 


25. 
26. 
27. 
28. 


29. 


30. 


3l. 


It’s natural that Wn W2 = {0}. And we have F” = {(a1,do,...,@n) 1a; € 
F} = {(a1, @2,..-,@n-1,0) + (0,0,...,an): a; ¢€ F} = Wi, © Wo. 


This is similar to the exercise 1.3.24. 
This is similar to the exercise 1.3.24. 
This is similar to the exercise 1.3.24. 


By the previous exercise we have (M,+Mp2)' = Mi + M5 =-(M,+ Mp2) and 
(cM)! = cM‘ = -cM. With addition that zero matrix is skew-symmetric 
we have the set of all skew-symmetric matrices is a space. We have 
Mnxn(F) = {A: A € Maxn(F)} = {(A+ A2) + (A- At): A € Maun (F)} = 
W,+W2. and Win W2 = {0}. The final equality is because 4+ A‘ is sym- 
metric and A - A’ is skew-symmetric. If F is of characteristic 2, we have 
W, = Ws. 


It’s easy that WinW2 = {0}. And we have Mnyn(F) = {A: A € Mnxn(F)} = 
{(A- B(A))+ B(A): A € Mnxn(F)} = Wi+W2, where B(A) is the matrix 


If V = W, @ W2 and some vector y € V can be represented as y = £1 + XZ = 
x, +25, where x1,27 € W; and x,24 € We, then we have x, - x, « Wi 
and #1 - 2 = @2+25 € Wo. But since Win Wo = {0}, we have a1 = 2} 
and x2 = «5. Conversely, if each vector in V can be uniquely written as 
t+ 22, then V = W,+ We. Now if c ¢ Win We and x # 0, then we have 
that x =2+0 with ce W, and 0€ W2 or x= 04+ 2 with 0€ W, and re Wo, 
a contradiction. 


(a) If v+W is a space, we have 0 = v+(-v) €v+W and thus -v € W and 
véW. Conversely, if ve W we have actually v + W = W, a space. 


(b) We can proof that v1 + W = v2 + W if and only if (v1 - v2) +W =W. 
This is because (—v,) + (v1 + W) = {-v+tv+w:weW} = W and 
(-v1) + (vo + W) = {-v, tue t+w:we Wh = (-v, + v2) +W. So if 
(v1 —v2)+W =W, aspace, then we have v1 - v2 € W by the previous 
exercise. And if vj — vo € W we can conclude that (v1 -v2) +W =W. 

(c) We have (v1 + W) + (vo + W) = (v1 + ve) +W = (v1, +05) + W = 
(vu, +W)+(v5+W) since by the previous exercise we have v; — vi, « W 
and v2 - v5 ¢ W and thus (v1 + v2) - (vt + v4) € W. On the other 
hand, since v, — v; € W implies av, - av} = a(v1 - vi) € W, we have 
a(vr + W) = a(v, + W). 

(d) It closed because V is closed. The commutativeity and associativity 
of addition is also because V is commutative and associative. For 
the zero element we have (x+W)+W =2+W. For the inverse 
element we have (x+W)+(-2+W) = W. For the identity element of 
multiplication we have 1(a+W)=2+W. The distribution law and 
combination law are also followed by the original propositions in V. 
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But there are one more thing should be checked, that is whether it 
is well-defined. But this is the exercise 1.3.31.(c). 


1.4 Linear Combinations and Systems of Linear 
Equations 


1. (a) Yes. Just pick any coeficient to be zero. 
(b) No. By definition it should be {0}. 
(c) Yes. Every subspaces of which S is a subset contains span(S) and 
span(S) is a subspace. 
(d) No. This action will change the solution of one system of linear 
equations. 
(e) Yes. 
(f) No. For example, 0x = 3 has no solution. 
Gy - @ - 243 - «wy = -3 
2. (a) Original system <= v3 + 2x4 4 .Sowe 
4x3 + 8x4 = 16 
have solution is {(5 +s - 3t,s,4—-2t,t):s,t¢F}. 
{(-2,-4,-3)}. 
No solution. 
{(-16 - 85,9 + 3s,5,2):s¢F}. 
{(-4+ 10s - 3t,3 - 35 + 2t,r,s,5):s,t¢F}. 
{(3, 4,-2)}. 
Yes. Solve the equation x1(1,3,0) + v2(2,4,-1) = (-2,0,3) and we 
have the solution (21, 2%2) = (4,-3). 
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5. 


10. 


11. 


12. 


13. 


(a) Yes. 


(b) No. 
c) No. 


( 

(d 
( 
( 


Yes. 
Yes. 
No. 
Yes. 
No. 


e€ 
f 


& 


) 
) 
) 
) 
) 
) 
(g) 
(h) 


For every (21, 22,23) € F? we may assume 
mi, 1,0) + yo(1, 0, 1) + y3(0, 1, 1) = (a1, 22,23) 


and solve the system of linear equation. We got (#1, 72,23) = $ (a1 —2Q+ 
x3)(1, 1,0) + $(x4 as He aoe x3)(1,0, 1) te $(-2 Plot x3)(0, 1, 1). 


For every (21, 22,...%y) € F” we can write (11,%2,...%n) = ©1€, + Leg + 
“+ In €n. 


It’s similar to exercise 1.4.7. 
It’s similar to exercise 1.4.7. 


For x # 0 the statement is the definition of linear combination and the set 
is a line. For x = 0 the both side of the equation is the set of zero vector 
and the set is the origin. 


To prove it’s sufficient we can use Theorem 1.5 and then we know W = 
span(W) is a subspace. To prove it’s necessary we can also use Theorem 
1.5. Since W is a subspace contains W, we have span(W) c W. On the 
other hand, it’s natural that span(W) 2 W. 


To prove span(S;) ¢ span(.S2) we may let v € S;. Then we can write 
UV = A121 + GQ%2Q + +++ +4323 where x; is an element of S; and so is S» for 
all n =1,2,...,n. But this means v is a linear combination of Sz: and we 
complete the proof. If span(S,) = V, we know span(.S2) is a subspace 
containing span(S;). So it must be V. 


We prove span(S} U.S2) ¢ span(S;) +span(S2) first. For v € span(S; U $2) 
we have v = Dji1 iti t Di) bjy; with ax; ¢ $1 and y; € Sy. Since the first 
summation is in span(.S;) and the second summation is in span(S2), we 
have v € span(S;) + span(.S2). For the converse, let w+ vu € span(S;) + 
span(.S2) with u € span(S,) and v € span(S2). We can right u+v 
Dini iti + Li byyj; with x; € $1; and yj € Sy and this means u+v € 
span(S1 U So). 
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14. For v€ span($; $2) we may write v = Dj) a;v; with x; € S$; and 2; € So. 
So v is an element of both span(S;) and span(.S2) and hence an element 
of span(.S) Nspan(S2). For example we have if S; = Sz = (1,0) then they 
are the same and if S; = (1,0) and Sp = (0,1) then we have the left hand 
side is the set of zero vector and the right hand side is the the plane R?. 


15. If we have both a,v1 + aqvq +++ + AnUp, = 0101 + dove +++-by Uy, then we have 
(a1 — b1)v1 + (a2 -— b2)vq +++ + (Gn - bn) Un = 0. By the property we can 
deduce that a; = 6; for all 7. 


16. When W has finite element the statement holds. Otherwise W - {v}, 
where vu ¢ W will be a generating set of W. But there are infinitely many 
veWw. 


1.5 Linear Dependence and Linear Independence 


1. (a) No. For example, take S = {(1,0), (2,0), (0,1)} and then (0,1) is not 
a linear combination of the other two. 
(b) Yes. It’s because 10 = 0. 
(c) No. It’s independent by the remark after Definition of linearly inde- 
pendent. 
(d) No. For example, we have S = {(1,0), (2,0), (0, 1)} but {(1, 0), (0,1)} 
is linearly independent. 


(e) Yes. This is the contrapositive statement of Theorem 1.6. 


(f) Yes. This is the definition. 
F 1 -3 -2 6 
2. (a) Linearly dependent. We have -2 ae i a eee 


check the linearly dependency is to find the nontrivial solution of 
equation a,271 + Q2%q +++ + Gn2%py = 0. And 2, and 2x2 are the two 
matrices here. 


. So to 


(b) Linearly independent. 
(c) Linearly independent. 
(d) Linearly dependent. 
(e 
( 


f 
(g) Linearly dependent. 


Linearly dependent. 


(h) Linearly independent. 
(i) Linearly independent. 
(j 
3. Let M1, Mo,..., Ms be those matrices. We have M,+M2+M3-M,-Ms = 0. 


) 
) 
) 
) 
) Linearly independent. 
) 
) 
) 
) 


Linearly dependent. 
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10. 


11. 


12. 


13. 


14. 


If aye, + deg + +++ + Ann = (41, d2,...,A4n) = 0, then by comparing the i-th 
entry of the vector of both side we have a, = a2 =-+: = Gn = 0. 


It’s similar to exercise 1.5.4. 
it’s similar to exercise 1.5.4. 


Let £;; be the matrix with the only nonzero 7j-entry= 1. Then {.£1,, E22} 
is the generating set. 


(a) The equation x; (1, 1,0)+a2(1,0,1)+23(0, 1,1) = 0 has only nontrivial 
solution when F = R. 


(b) When F has characteristic 2, we have 1+1 = 0 and so (1,1,0) + 
(1,0, 1) + (0, 1,1) = (0,0,0). 


It’s sufficient since if u = tv for some t € F then we have u—- tv = 0. While 
it’s also necessary since if aw + bu = 0 for some a,b ¢ F with at least one of 
the two coefficients not zero then we may assume a # 0 and u= -2y, 


Pick v; = (1, 1,0), ve = (1,0, 0), v3 = (0,1,0). And we have that none of the 
three is a multiple of another and they are dependent since v) — v2 —v3 = 0. 


Vector in span(.S) are linear combinations of S and they all have different 
representation by the remark after Definition of linear independent. So 
there are 2” representations and so 2” vectors. 


Since S; is linearly dependent we have finite vectors 71,%2,...,p in S} 
and so in S3 such that a1271+a2%2+-+'+Gn2p = 0 is a nontrivial representa- 
tion. But the nontrivial representation is also a nontrivial representation 
of Sg. And the Corollary is just the contrapositive statement of the The- 
orem 1.6. 


(a) Sufficiency: If {u+v,u-v} is linearly independent we have a(u+v) + 
b(u-v) = 0 implies a = b = 0. Assuming that cu + dv = 0, we can 
deduce that S48 (u+ v)+ =4(u- v) = 0 and hence ad = a4 =0. This 
means c= d= 0 if the characteristc is not two. Necessity: If {u,v} is 
linearly independent we have au + bv = 0 implies a = b=0. Assuming 
that c(u+v) +d(u-v) =0, we can deduce that (c+ d)u+(c-d)v =0 
and hence c+d = c—d=0 and 2c=2d=0. This means c= d = 0 if the 
characteristc is not two. 

(b) Sufficiency: If au + bv + cw =0 we have “2-£(u+v) + 2*(ut w) + 
=atPte(y + w) = 0 and hence a = b= c= 0. Necessity: If a(u+v) + 
b(ut+ w)+c(ut+w) =0 we have (a+ b)ut+(a+c)uv+(b+c)w =0 and 
hence a=b=c=0. 


Sufficiency: It’s natural that 0 is linearly dependent. If v is a linear 
combination of ui, U2,...,Un , SAY V = A, U1, + G2QU2 +°-AnUn, then v- au, - 
agUug — +++ — GnUn = 0 implies S$ is linearly dependent. Necessity: If S$ is 
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15. 


16. 


17. 


18. 
19. 


20. 


linearly dependent and S # {0} we have some nontrivial representation 
AoUp + a1U, +°++++ GnUn = 0 with at least one of the coefficients is zero, say 
ago = 0 without loss the generality. Then we can let v = ug = 2 (aru + 
AgUg +++ + AnUn). 


Sufficiency: If wu; =0 then S is linearly independent. If 


Uns € Span({u1, U2,...,UK}) 


for some k, say Ugy1 = U1 +G2Qugt+-:++G,UR, then we have ayu,+agug+---+ 
ApUk — Uk+1 = 0 is a nontrivial representation. Necessary: If S is linearly 
dependent, there are some integer & such that there is some nontrivial 
representation aju, + QgUq +++ + GpUy + Api1Upi1 = 0. Furthermore we 
may assume that az41 # 0 otherwise we may choose less k until that 
Gp+1 #= 0. Hence we have aps, = -—(ayu, + agu2 +++ agug) and so 


Qk+1 
Ag+1 € Span({ui,U2,--.,UK})- 


Sufficiency: We can prove it by contrapositive statement. If S' is linearly 
dependent we can find ayuy, + agug +++ +GnUn = 0. But thus the finite set 
{u1,U2,-..,Un} would be a finite subset of S and it’s linearly dependent. 
Necessary: This is the Threorem 1.6. 


Let Cy,Co,...,C, be the columns of M. Let a,C) + agCo +--+ an,Cy, = 0 
then we have a,, = 0 by comparing the n-th entry. And inductively we 
have ay_1 = 0,dn_2 = 0,...,a, = 0. 


It’s similar to exercise 1.5.17. 


We have a, Aj + a2A§ +--+ +a,At =0 implies a,A; + a2Aq +--+ apAp = 0. 
Then we have a, = dg =-:: = Qn, = 0. 


If {f,g} is linearly dependent, then we have f = kg. But this means 
1 = f(0) = kg(0) = kx 1 and hence k = 1. And e” = f(1) = kg(1) = e° 


means r = S. 


1.6 Bases and Dimension 


1. 


(a) No. The empty set is its basis. 
(b) Yes. This is the result of Replacement Theorem. 

No. For example, the set of all polynomials has no finite basis. 
No. R? has {(1,0), (1,1)} and {(1,0), (0, 1)} as bases. 

Yes. This is the Corollary after Replacement Theorem. 


No. It’s n+ 1. 


(c 
(d 
( 
( 


O 


f 


g) No. It’s mx n. 


) 
) 
) 
) 
) 
) 
(g) 
(h) 


Yes. This is the Replaceent Theorem. 
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(i) No. For $= 1,2, a subset of R, then 5=1x1+2x2=3x1+1x2. 
(j) Yes. This is Theorem 1.11. 

(k) Yes. It’s {O} and V respectly. 

(1) Yes. This is the Corollary 2 after Replacement Theorem. 


2. It’s enough to check there are 3 vectors and the set is linear independent. 


4. It’s impossible since the dimension of P3(R) is four. 


OU 


. It’s also impossible since the dimension of R? is three. 


6. Let Ej; be the matrix with the only nonzero 7j-entry= 1. Then the sets 
{Fi1, Fiz, E21, E22}, {Fit Fie, Fiz, Foi, E22}, and {£11+£a1, E12, E21, E22} 
are bases of the space. 


7. We have first {u1,u2} is linearly independent. And since u3 = —4u; and 
ua = —3u, + 7u2, we can check that {u1, u2, us} is linearly independent and 
hence it’s a basis. 


8. To solve this kind of questions, we can write the vectors into a matrix as 
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below and do the Gaussian elimintaion. 
2 -3 4 -5 2 
-6 9 -12 15 -6 
3-2 7 -9 1 


O <3 = @ i 
it @ — 2.2 
y =) 4 =o -% 
? -3 &@ -=11 ¢ 
00 0 0 0 
oO =<f 19: <8 % 
0-8 6 -8 10 
“lo 1 0 4 = «-5 
O t 6. =} <a 
to 2 2 = 
0-1 5 -15 11 
0 0 26 -20 -6 
00 0 0 0 
00 1 4 -5 
OO: ba 232 =99 
“10 0 =—@. =F <1 
1 6 -=-9 4 
io 2 3 = 
00 11 -18 7 
00 0 -124 124 
00 0 0 0 
00 1 #4 ~— -5 
0 0 0 248 248 
“1G 0 O 31 =s1 
01 6 =<} A 
Lic 2 3 2 
00 0 -62 62 
00 0 -124 124 
00 0 0 0 
001 4 -5 
00 0 0 0 
“lo 0 0 0 0 
016 -3 4 
to <2 & 2 
00 0 0 0 


And the row with all entries 0 can be omitted!] So {u1, U3, Ue, U7} would 


1 Which row with all entries is important here. So actually the operation here is not the 
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be the basis for W (the answer here will not be unique). 


9. If a U1 + AQU2 + AZU3 + A4U4 = (a1, a, + 42,41 +42 + 23,a, + A2+ a3 + a4) =0 
we have a; = 0 by comparing the first entry and then ag = a3 = a4 = 0. For 
the second question we can solve (a1, a2, 43,44) = @1U1 + (a2 — a1) ug + (a3 - 
ag)u3 + (a4 — a3) U4. 

10. The polynomials found by Lagrange interpolation formula would be the 
answer. It would have the smallest degree since the set of those polyno- 
mials of Lagrange interpolation formula is a basis. 


(a) —4a7-24+8. 
(b) -3a + 12. 
(c) -a2? +22? + 4a -5. 
(d) 2x3 — x? - 6x +15. 
11. If {u,v} is a basis then the dimension of V would be two. So it’s enough 
to check both {u+v,au} and {au, bv} are linearly independent. Assuming 


s(u+v) +tau = (s+ta)u+sv =0 we have s+ta=s=0 and hence s =t = 0. 
Assuming sau + tbv = 0 we have sa = tb=0 and hence s =t =0. 


12. If {u,v,w} is a basis then the dimension of V would be three. So it’s 
enough to check {u+u+w,u+w,w} is lineaerly independent. Assuming 
a(ut+utw)t+b(v+w)+cw = aut (at b)vt+(at+b+c)w = 0 we have 
a=a+b=a+b+c=0 and hence a=b=c=0. 


13. We can substract the second equation by the two times of the first equa- 
tion. And then we have 


21 - 242+ 43 =0 
r2-2%3=0 


Let 23 = s and hence x2 = s and 2, = s. We have the solution would be 
{(s, s,s) = s(1,1,1):s¢R}. And the basis would be {(1,1,1)}. 


14. For W, we can observe that by setting ag = p, a3 = q, a4 = 8, and as =t we 
can solve a, = q+s. So W, = {(q+s,p,q, 8,t) = p(0,1,0,0,0)+q(1, 0,1, 0,0)+ 
s(1,0,0, 1,0) +¢(0,0,0,0,1):p,q,s,t¢F°}. And 


{(0, 1,0,0, 0), (1,0, 1,0, 0), (1,0, 0, 1,0), (0, 0, 0, 0, 1)} 


is the basis. The dimension is four. And similarly for W2 we may set 
a4 = 8, a5 = t. And then we have a, t, dg = a3 =a4=5 and 


W, = {(-t, s,s, s,t) = s(0,1,1,1,0) +t(-1,0,0,0,1):s,t¢F°} 
. And hence 


{(0,1,1,1,0),(-1,0,0,0,1)} 


is the basis of Wz. The dimension is two. 


standard Gaussian elimination since we can not change the order of two row here. 
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15. 


16. 


17. 


18. 


19. 


20. 


yal 


22. 


Just solve Au +Aoo ttAnn = 0 and hence {Eij }igj U{ Eui— Enn }i-t,2,...n-1} 
is the basis, where {£;;} is the standard basis. And the dimension would 
be n? - 1. 


We have A; = 0 for all i> j. Hence the basis could be {F£;;}i<;} and the 
n(nt1) 
ae 


dimension is 


We have A;; = 0 and A;; = Aj;. Hence the basis could be {Ej — Ej; }i<; 
and the dimension is ated) 

Let e; be the sequence with the only nonzero i-th term= 1. Then we 
have {e;}i20 is a basis. To prove it, we have that every sequence is linear 
combination of the basis since we only discuss the sequence with finite 
nonzero entries. Furthermore we have for every finite subset of {e;}is0 is 
linearly independent. 


If every vector has a unique representation as linear combination of the set 
8, this means every vector is a linear combination of 3. Furthermore, if 
there are nontrivial representation of 0, then we have there are two repre- 
sentations, say the trivial and nontrivial one, of 0. This is a contradiction. 


(a) If S = @ or S = {0}, then we have V = {0} and the empty set can 
generate V. Otherwise we can choose a nonzero vector u; in S, and 
continuing pick uz+1 such that ugy1 ¢ span({u1,u2,...,ug}). The 
process would teminate before k > n otherwise we can find linearly 
independent set with size more than n. If it terminates at k = n, 
then we knoew the set is the desired basis. If it terminates at k <n, 
then this means we cannot find any vector to be the vector uz4;. So 
any vectors in S$ is a linear combination of 6 = {u1,u2,...,uz} and 
hence 6 can generate V since S can. But by Replacement Theorem 
we have n<k. This is impossible. 


(b) If S has less than n vectors, the process must terminate at k <n. It’s 
impossible. 


Sufficiency: If the vector space V is finite-dimensional, say dim= n, and 
it contains an infinite linearly independent subset (, then we can pick an 
independent subset 8’ of 6 such that the size of 6’ is n+ 1. Pick a basis 
a with size n. Since a is a basis, it can generate V. By Replacement 
Theorem we have n >n+1. It’s a contradiction. Necessity: To find the in- 
finite linearly independent subset, we can let S be the infinite-dimensional 
vector space and do the process in exercise 1.6.20(a). It cannot terminate 
at any k otherwise we find a linearly independent set generating the space 
and hence we find a finite basis. 


The condition would be that Wy c W2. Let a and £ be the basis of W1nW 
and W,. Since W; and W3 finite-dimensional, we have a and £ are bases 
with finite size. First if W , is not a subset of W2, we have some vector 
vé€W,\W2. But this means that v ¢ span() and hence SU {v} would be a 
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23. 


24. 


25. 


26. 
27. 


28. 


29. 


independent set with size greater than that of @. So we can conclude that 
dim(W, n W2) =dim(W,). For the converse, if we have W; c W2, then we 
have W,n W2 = W, and hence they have the same dimension. 


Let a and £ be the basis of W, and W32. By the definition we have both 
qa and § are bases with finite size. 


(a) The condition is that v ¢ W). If v¢ W, = span(a), thus aU{v} would 
be a independent set with size greater than a. By Replacement The- 
orem we have dim(W,) <dim(W2). For the converse, if v ¢« W, = 
span({v1, v2,...,ve}), we actually have W2 = span({v1, v2,-.-.,Uk,Uv}) = 
span({v1, v2,..., Uz }) = Wi and hence they have the same dimension. 


(b) Since we have W, c Wo, we have in general we have dim(W,) <dim(W2). 


By exercise 1.5.18 we have 6 = {fO}in01 ae n is independent since they 
all have different degree. And since dim(P,,(R)) = +1 we can conclude 
that 6 is a basis and hence it generate the space P,,(R)). 


It would be m+n since (a,0) U (0,8) would be a basis of Z if a and 3 
are the basis of V and W respectly, where (a,0) = {(u,0)¢ Z: we V} and 
(0,8) ={(0,u)€Z: ue W}. 

It would be n since {x -a,a? -a?,...,2” —a™} is a basis. 

The dimension of WinP,,(F) and W2nP,,(F) are | 24+] and [| “4*] respectly 
since {x*} with 0 <i <n is an odd number and {z/} withO<j<nisa 
even number are bases of the two spaces respectly. 


If a is the basis of V over R, then we have a Uta is the basis of V over R, 
where ia = {ive V:vea}. 


(a) Using the notation of the Hint, if we assume 


k m n 

* Xx x 
Y atts + >) bv: + Yaw =0 
i=1 i=1 t=1 


, then we have 


m k n 
V= Y biv; = - Vayu; - >) cw, 
i=l i=1 


2 
I 
B 


is contained in both W; and W2 and hence in Win W.. But if 
v + 0 and can be express as u = Y*, a/u;, then we have 1%, b:v; — 
ye, au; = 0. This is contradictory to that {u1,...,01,...} is a basis 
of W,. Hence we have 


m k n 
V= bi; = - Y aju; - Yaw =0 
i=1 i=1 i=1 
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30. 


3l. 


32. 


33. 


, this means a; = 6; = q = O for all index 7, 7, and k. So the set 
B= {u1,...,U1,--.,W1,-.-} is linearly independent. Furthermore, for 
every 7+ y € Wy + We with x ¢ W, and y € W2 we can find the 
representation x = 7%, dju; + ™, bj; and y = VE, dius + DL Guy. 
Hence we have 


k Mm 7 
ee ea YY (di + Gust D5 divi + DS caw; = 0 
1 i=1 i=l 


is linear combination of 6. Finally we have dim(W; + W2) =k+m+ 
n =dim(W,)+dim(W2)-dim(W, 9 W2) and hence W, + W% is finite- 


dimensional. 


(b) With the formula in the previous exercise we have 


dim(W; + Wo) = dim(W}) + dim(W2) = dim(W, nN W2) 
= dim(W,) + dim(W2) 


if and only if dim(W, nN W2) = 0. And dim(W, n W2) = 0 if and only 
if W, nN W2 = {0}. And this is the sufficient and necessary condition 
for V = Wi ® Wo. 


It can be check W, and W2 are subspaces with dimension 3 and 2. We 


: ‘i eV: a,b € F} and it 
has dimension 1. By the formula of the previous exercise, we have that 


dimension of W, + W2 is 2+3-1=4. 


also can find out that Win W2 = { 


(a) This is the conclusion of W,n W2 c Wo. 
(b) By the formula in 1.6.29(a) we have the left hand side 


=m+n-dim(WinW2)<min 
since dim(W,n W2) > 0. 
(a) Let W, be the xy-plane with m= 2 and W2 be the z-axis with n = 1. 


Then we have W, n W2 = W2 has dimension 1. 


(b) Let W, be the xy-plane with m = 2 and W»2 be the z-axis with n = 1. 
Then we have W + W2 = R? has dimension 3 = 2+ 1. 

(c) Let W, be the ry-plane with m = 2 and W2 be the xz-axis with n = 2. 
Then we have W, nN W2 is the x-axis with dimension 1 and W,; + W2 
is R? with dimension 3 # 2 +2. 


(a) Since V = W, @ Wz means W, 1 We = {0} and a basis is linearly 
independent and so contains no 0, we have 6, 2 = @. And it a 
special case of exercise 1.6.29(a) that (1 U Bo is a basis. 


23 


(b) Let u; and v; are vectors in 6; and {2 respectly. If there is a nonzero 
vector u € W,n Wo, we can write u = Dit) au; = Dad bjv;. But it 
impossible since it will cause 


n m 
>) au; = >. bjv; =0 
i=1 j=l 


On the other hand, for any v € V, we can write v = YL, cui + 
ee djv; € Wi + We. For any x+y € Wi + We with x ¢ W; and 
y € Wo, we have x = Dj, eu; and y = Lj, fjvj;. Thus we have 
L+ty= Dey ei; + viet fjvj eV. 
34. (a) Let 6 be the basis of V and a be the basis of W;. By Replacement 


Theorem we can extend a to be a basis a’ of V such that aca’. By 
the previous exercise and let W2 =span(a’\a), we have V = W, @W2. 


(b) We may set W2 to be the y-axis and W5 to be {t(1,1):t¢ R}. 


35. (a) Since {w1,u2,...,Un} is a basis, the linear combination of 
{urs41, Uk+2; a tepene Un} 
can not be in span({wj, u2,...,uz}) = W. This can make sure that 


{upsi + W,uri2 + W,...,Un +W} 


is linearly independent by 1.3.31(b). For all u+W ¢ V/W we can 
write 


ut W = (ayuy + aque +++ Gyn) +W 
= (Q,Uz + GgQUg +++ + ApUK) + (GparUgs1 + OppoUKs2 $21 + AnUn) +W 
= (Gg41Uk+1 + Aks2Uk+2 +7 + GnUn) +W 
= Gp+1(Uge1 + W) + Gpso(Ugsg + W) +++ Gn(Un + W) 


and hence it’s a basis. 
(b) By preious argument we have dim(V/W) = n-k =dim(V)-dim(W). 


1.7 Maximal Linearly Independent Subsets 
1. (a) No. For example, the family {(0,n)}n>1 of open intervals has no 
maximal element. 


(b) No. For example, the family {(0,7)}n>1 of open intervals in the set 
real numbers has no maximal element. 


(c) No. For example, the two set in this family {1,2,2,3} are both 
maximal element. 
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(d) Yes. If there are two maximal elements A and B, we have Ac B 
or Bc A since they are in a chain. But no matter Ac Bor Bc A 
implies A = B since they are both maximal elements. 


(e) Yes. If there are some independent set containing a basis, then the 
vector in that independent set but not in the basis cannot be a linear 
combination of the basis. 


(f) Yes. It’s naturally independent. And if there are some vector can not 
be a linear combination of a maximal independent set. We can add 
it to the maximal independent set and have the new set independent. 
This is contradictory to the maximality. 


. Basis described in 1.6.18 is a infinite linearly independent subset. So the 
set of convergent sequences is an infinite-dimensional subspace by 1.6.21. 


. Just as the hint, the set {7,7?,...} is infinite and independent. So V is 
indinite-dimensional. 


. By Theorem 1.13 we can extend the basis of W to be a basis of V. 


. This is almost the same as the proof of Theorem 1.8 since the definition 
of linear combination of a infinite subset 3 is the linear combinations of 
finite subset of 8. 


. Let F be the family of all linearly independent subsets of S2 that contain 
S,. We may check there are some set containing each member of a chain 
for all chain of ¥ just as the proof in Theorem 1.13. So by Maximal 
principle there is a maximal element 6 in ¥. By the maximality we 
know £ can generate Sg and hence can generate V. With addition of it’s 
independence we know (£ is a basis. 


. Let ¥ be the family of all linearly independent subset of 2 such that union 
of it and S is independent. Then for each chain @ of ¥ we may choose 
U as the union of all the members of @. We should check U is a member 
of #. So we should check wether S UU is independent. But this is easy 
since if 7.1 aivi + L32, bjuj = 0 with v; ¢ S and u; € U, say uj « Uj where 
{U;} are a members of @, then we can pick the maximal element, say Ui, 
of {U;}. Thus we have u,; € U; for all 7. So SuU is independent and 
hence a; = 6; = 0 for all 7 and 7. 


Next, by Maximal principle we can find a maximal element S$; of @. So 
SUS} is independent. Furthermore by the maximality of 5S; we know that 
SUS can generate 6 and hence can generate V. This means SUS} isa 
basis for V. 
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Chapter 2 


Linear Transformations and 
Matrices 


2.1 Linear Transformations, Null Spaces, and 


1. 


Ranges 


(a) Yes. That’s the definition. 


(b) No. Consider a map f from C over C to C over C by letting f(x+iy) = 
xz. Then we have f(a1 + iy, + v2 + ty2) = 41 + x2 but f(iy) = 0 #= 
if(y) = ty. 

(c) No. This is right when T is a linear trasformation but not right in 
general. For example, 


T:R-R 


reart+l 


It’s one-to-one but that T(x) = 0 means x = -1. For the counterex- 
ample of converse statement, consider f(a) = |z]. 


(d) Yes. We have T(0v) = T(0x) = OT (Ov )w = Ow, for arbitrary ¢ V. 


(e) No. It is dim(V). For example, the transformation mapping the real 
line to {0} will be. 


(f) No. We can map a vector to zero. 
(g) Yes. This is the Corollory after Theorem 2.6. 
(h) No. If v2 = 221, then T(x2) must be 27 (21) = 2y1. 


2. It’s a linear transformation since we have 


T ((a1, a2, a3)+(b1, be, b3)) = T (a1 +b1, adg+b2, a3+b3) = (a1+by-a2-bo, 2a3+2bs3) 


= (a1 — G2, 2a3) + (b1 — bz, 2b3) = T(a1, a2, a3) + T(b1, be, bs) 
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10. 


. It’s a linear transformation. And N(T) = ( on 


and 

T (cay, Caz, caz) = (c(a1 - a2), 2ca3) = CT (a1, a2, a3). 
N(TL) = {(a1,@1,0)} with basis {(1,1,0)}; R(T) = R? with basis {(1,0), (0, 1)}. 
Hence T is not one-to-one but onto. 


. Similarly check this is a linear transformation. N(T) = {0} with basis @; 


R(T) = {ai(1, 0,2) + a2(1,0,-1)} with basis {(1,0,2),(1,0,-1)}. Hence 
T is one-to-one but not onto. 


2a —4a : 
ae a } with 
a21 422 423 

basis 


RO)=(( 5} with basis (4 ates 5 )} 


. It’s a linear transformation. And N(T) = {0} with basis @; R(T) = 


{ax? + b(a?+1)+ex} with basis {a°?, 27+ 1,2}. Hence T is one-to-one but 
not onto. 


. N(T) is the set of all matrix with trace zero. Hence its basis is { Ej; bie; U 


{Ex = Enn }i=i,2,...,n-1° R(T) = F with basis 1. 


. For property 1, we have T(0) = T(0x) = OT (x) = 0,where «x is an arbitrary 


element in V. For property 2, if T is linear, then T(cat+y) = T(cr)+T(y) = 
cT (x) +T(y); if T(ca+y) = cT(x)+T(y), then we may take c= 1 or y=0 
and conclude that T is linear. For property 3, just take c = —1 in property 
3. For property 4, if T is linear, then 


1S at) =T(a121) + (y aja) = + = YT (ae) = Yar (es) 


if the equation holds, just take n = 2 and a, = 1. 


. Just check the two condition of linear transformation. 


(a) T(0,0) # (0,0). 
(b) T(2(0, 1)) = (0,4) # 2T(0, 1) = (0, 2). 
(c) T(25,0) = (0,0) + 27(5,0) = (2,0). 
(d) 

) 


d) T((1,0) + (-1,0)) = (0,0) + T(1,0) + T(0, 1) = (2,0). 
(ce) T(0,0) + (0,0). 


We may take U(a,b) = a(1,4) + 6(1,1). By Theorem 2.6, the mapping 
must be T’= U. Hence we have T(2,3) = (5,11) and T is one-to-one. 
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11. 


12. 
13. 


14. 


15. 


16. 


Lf 


18. 
19. 


20. 


21. 


This is the result of Theorem 2.6 since {(1,1),(2,3)} is a basis. And 
T(8, 11) = T(2(1, 1) + 3(2,3)) = 27(1, 1) + 37 (2, 3) = (5, -3, 16). 


No. We must have T'(-2,0,-6) = -27(1,0,3) = (2,2) # (2,1). 


Let pea a;v; = 0. Then we have TS QjiVi) = ye a;T(v;) = 0 and this 
implies a; = 0 for all 2. 


(a) The sufficiency is due to that if T(x) = 0, {a} can not be independent 
and hence x = 0. For the necessity, we may assume )\a;T(v;) = 0. 
Thus we have T()a;v;) = 0. But since T is one-to-one we have 
¥ a;v; = 0 and hence a; = 0 for all proper 7. 

(b) The sufficiency has been proven in Exercise 2.1.13. But note that 


S may be an infinite set. And the necessity has been proven in the 
previous exercise. 


(c) Since T is one-to-one, we have T(() is linear independent by the 


previous exercise. And since T is onto, we have R(T’) = W and hence 
span(T(8)) = R(T) = W. 


We actually have T(X".) aix’) = Ny “+ a'*!. Hence by detailed check we 


1=0 G41 
know it’s one-to-one. But it’s not onto since no function have integral= 1. 
Similar to the previous exercise we have T(X".) aiv’) = Niko iaix’ |. It’s 
onto since T(Yj-9 S42"*') = Loa’. But it’s not one-to-one since 
T(1) = T(2) =0. 
(a) Because rank(T) <dim(V) <dim(W) by Dimension Theorem, we 
have R(T) ¢ W. 
(b) Because nullity(T) =dim(V)-rank(T) >dim(V )-dim(W) > 0 by Di- 
mension Theorem, we have N(T) # {0}. 
Let T(2,y) = (y,0). Then we have N(T) = R(T) = {(#,0): a € R}. 


Let T : R? > R? and T(z,y) = (y,x) and U is the identity map from 
R? > R?. Then we have N(T) = N(U) = {0} and R(T) = R(U) = R?. 


To prove A = T(Vj) is a subspace we can check first T(0) = 0 € A. For 
Y1,Y2 € A, we have for some 21,22 € V; such that T(a,) = y, and T(x2) = 
ya. Hence we have T(21 + @2) = y, + y2 and T(ca,) = xy,. This means 
both y; + y2 and cy, are elements of A. 

To prove that B = {x ¢€ V : T(x) € W,} is a subspace we can check 
T(0) =0¢€ W, and hence 0€ B. For 11,22 € B, we have T(x1),T (a2) € Wi. 
Hence we have T(x%1 +22) = T(#1),T (a2) € Wi and T(cx1) = cT (a1) € Wi. 
This means both x; + 22 and cx, are elements of B. 


(a) To prove T is linear we can check 


T(o1 + 02)(n) = 01 (n +1) +00(n +1) = T(o1)(n) + T(o2)(n) 
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22. 


23. 


24. 


25. 


26. 


and 
T(co)(n) = co(n +1) =cT(c)(n). 


And it’s similar to prove that U is linear. 


(b) It’s onto since for any o in V. We may define another sequence T 
such that 7(0) = 0 and t(n +1) = o(n) for all n> 1. Then we have 
T(r) =¢. And it’s not one-to-one since we can define a new oo with 
o0(0) = 1 and oo(n) = 0 for all n > 2. Thus we have og # 0 but 
T (00) =(; 


If T(c)(n) = o(n-1) =0 for all n > 2, we have o(n) =0 for all n> 1. 
And let og be the same sequence in the the previous exercise. We 
cannot find any sequence who maps to it. 


— 
lo) 
Ww 


Let T(1,0,0) = a, T(0,1,0) = 6, and T(0,0,1) =c. Then we have 
T(2,y, z) = £T(1,0,0) + yT (0, 1,0) + 27 (0,0, 1) = ax + by + cz. 
On the other hand, we have T(21,22,...,%n) = @121 + Gq%q +++ GnUn, if 


T is a mapping from F” to F. To prove this, just set T(e;) = a;, where 
{e;} is the standard of F”. 


For the case that 7’: F” > F, actually we have 
n n n 
T (21,22, see Ln) = ex Q1jX5, » a2;X;, seey Se GiB; ) 
j=l j=2 j=m 


. To prove this, we may set T(e;) = (@1;,@27,---,@mj)- 
With the help of the previous exercise, we have 

N(T) = {(2,y, z): ax + by + cz = OF. 
Hence it’s a plane. 


(a) It will be T (a,b) = (0,6), since (a,b) = (0,6) + (a, 0). 
(b) It will be T(a,b) = (0,b-a),since (a,b) = (0,6-a) + (a,a). 


(a) Let W, be the xy-plane and W, be the z-axis. And (a, b,c) = (a,b,0)+ 
(0,0,c) would be the unique representation of W; 6 W2. 


(b) Since (a,b,c) = (0,0,c) + (a,b,0), we have T(a, b,c) = (0,0,c). 
(c) Since (a,b,c) = (a-c,b,0) + (c,0,c), we have T(a, b,c) = (a-c,0,0). 


(a) Since V = Wy ® Wo, every vector « have an unique representation 
©=2%1+ 22 with 2, € W, and x2 € W2. So, now we have 


T(a+cy) =T(21 + 22+ cy + cy2) 


= T((x1 + cy) + (2 + cy2)) = 21 + cy, = T(x) +cT(y). 


29 


27. 


28. 


29. 


30. 


3l. 


32. 


33. 


And hence it’s linear. 

On the other hand, we have « = « + 0 and hence T(x) = x if x € Wy. 
And if x ¢ W,, this means x = 71 + 22 with x2 + 0 and hence we have 
T(x) =a, #2142. 

(b) If x1 € W, then we have T(21 + 0) = 21 € R(T); and we also have 
R(T) c Wy. If x2 € W2 then we have T(x2) = T(0+22) = 0 and hence 
x2 € N(T); and if « € N(T), we have x = T(x) +” =0+ and hence 
LE Wo. 

(c) It would be T(x) = x by (a). 

(d) It would be T(x) = 0. 


(a) Let {v1,v2,...,v~} be a basis for W and we can extend it to a basis 
GB = {v1,V2,...,Un} of V. Then we may set W’ =span({vp41, Ue+2;---;Un})- 
Thus we have V = W@W’ and we can define T be the projection on 
W along W’. 


(b) The two projection in Exercise 2.1.24 would be the example. 


We have T(0) = 0€ {0}, T(x) € R(T), T(x) =O0€ N(T) if x € N(T) and 
hence they are T-invariant. 


For x,y ¢ W, we have x+cy € W since it’s a subspace and T(),T(y) « W 
since it’s T-invariant and finally T(# + cy) = T(x) +cT(y). 


Since T(x) € W for all x, we have W is T-invariant. And that Ty = Iw is 
due to Exercise 2.1.26(a). 


(a) Ifae W, we have T(x) € R(T) and T(x) € W since W is T-invariant. 
But by the definition of direct sum, we have T(x) ¢ R(T) W = {0} 
and hence T(x) = 0. 

(b) By Dimension Theorem we have dim(N(T)) =dim(V)-dim(R(T)). 
And since V = R(T) ® W, we have dimW =dim(V)-dim(R(T)). In 
addition with W c N(T) we can say that W = N(T). 

(c) Take T be the mapping in Exercise 2.1.21 and W = {0}. Thus W # 
N(T) = {(a1,0,0,...)}. 


We have N(Tw) ¢ W since Tw is a mapping from W to W. For «¢« W 
and « € N(Tw), we have Tw(«) = T(x) = 0 and hence xe N(T). For the 
converse, if « ¢ N(T) NW, we have x € W and hence Ty(x) = T(x) = 0. 
So we’ve proven the first statement. For the second statement, we have 


R(Tw) = {ye W : Tw(2) =y,0€W} = {Tw(2): 2¢ W} = {T(2): ce WH. 


It’s natural that R(T) >span({T(v) : v € G}) since all T(v) is in R(T). 
And for any y¢ R(T) we have y= T(x) for some x. But every « is linear 
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combination of finite many vectors in basis. That is, x = eee a,v; for some 
vu; € 8. So we have 


k k 
y = ry. a,j) = », a;T (vj) 
i=l i=l 
is an element in span({T(v) : vu € {}). 


34. Let T be one linear transformation satisfying all conditions. We have that 
for any x € V we can write x = ya a;v; for some vu; € 8. So by the 
definition of linear transformation we have T(x) must be 


k k k 
T(d) avi) = YF aT (44) = Ya f(r) 
i=l i=1 i=l 
,a fixed vector in W. So it would be the unique linear transformation. 
35. (a) With the hypothesis V = R(T) + N(T), it’s sufficient to say that 
R(T) N(T) = {0}. But this is easy since 
dim( R(T) n N(T)) = dim( R(T)) + dim(N(T)) - dim( R(T) + N(T)) 
= dim(R(T)) + dim(N(T)) - dim(V) = 0 
by Dimension Theorem. 
(b) Similarly we have 
dim( R(T) + N(T)) = dim( R(T)) + dim(N(T)) - dim(R(T)n N(T)) 
= dim(R(T)) + dim(N(T)) - dim({0}) = dim(V) 
by Dimension Theorem. So we have V = R(T) + N(T). 
In this case we have R(T) = V and N(T) = {(a1,0,0,...)}. So 
naturally we have V = R(T)+N(T). But V is a direct sum of them 
since R(T) n N(T) = N(T) # {0}. 
(b) Take T; = U in the Exercise 2.1.21. Thus we have R(T\) = {(0, a1, a2,...)} 


and N(T;) = {(0,0,...)}. So we have R(T,) n N(T,) = {0} but 
R(T;) - N(T1) = R(T;) #V. 


37. Let c= $ €Q. We have that 


36. (a 


wa 


1 1 1 1 
T(a) = P52 + a2 feet 5”) = br(; 2) 
—————— 

b times 


and hence T(x) = T(x). So finally we have 


i 24 

T (cx) = T(;e) =T(;2+> 

—$—$_$—$— 
a times 


+ ie 
x 5e 


aT(>) = 52 (2) =cT(z). 
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38. It’s additive since 


T (21+ iyi) + (%2 + tye)) = (41 + £2) -i(y1 + y2) = T (a1 +ty1) + T (x2 + tye). 
But it’s not linear since T(i) = -i # iT(1) = 0. 
39. It has been proven in the Hint. 
40. (a) It’s linear since 
n(ut+v) =(utv)+W =(ut+W)+(v+W) =n(u) + 7(v) 
and 
n(cv) =cu+ W =c(v+ W) =cn(v) 


by the definition in Exercise 1.3.31. And for all element v+ W in 
V/W we have n(v) = u+ W and hence it’s onto. Finally if n(v) = 
u+W =0+W we have v-0=veW. Hence N(n) = W. 


(b) Since it’s onto we have R(T) = V/W. And we also have N(7) = W. 
So by Dimension Theorem we have dim(V) =dim(V/W)+dim(W). 


(c) They are almost the same but the proof in Exercise 1.6.35 is a special 
case of proof in Dimension Theorem. 


2.2 The Matrix Representation of a Linear Trans- 
formation 


(a) Yes. This is result of Theorem 2.7. 
(b) Yes. This is result of Theorem 2.6. 
) 

( 


(f) No. A transformaion of £(V,W) can not map element in W in 
general. 


2. (a) We have T(1,0) = (2,3,1) = 2(1,0,0) + 3(0,1,0) + 1(0,0,1) and 
T(0,1) = (-1,4,0) = -1(1,0,0) + 4(0, 1,0) + 0(0,0,1). Hence we get 
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(g) 


3. Since 


we have 


and 


1 

10 + 0 
u-(3 2. 

ft $0). 20 

0 1 
m-(" || 

1 0 
[T]}=(1 0 a 


1 2 
£410) = CL, 1,2) > “3 Ul 1,0) + 0(0, 1, 1) + 3 (212.3) 
T(0,1) = (-1,0,1) = -1(1, 1,0) + 1(0, 1, 1) + 0(2, 2,3) 
T(1,2) = (-1,1,4) = -2(1,1,0) +2(0,1,1) + (22,3) 


11 4 
T(2,3) = ( 1,2,7) = 3 CLA, 0) S00.) Mya, (25258) 
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4. Since 


we have 


(f) 


ae 


4 


4 


NG es | eS 


; p )=Le ae #00" 
; y Le aes t0% 
; ) ) =O Be + ox" 
:  ) =O Be #00" 


mS 
[ey 
8 
U 
ooor 
oHroOOo 
SCOrFO 
i) 


4 
Al 
ys) 

U 
cone 
SCONH 
wNoOoNnNoO 


1 
Ale) 
4 
3 
[F(@)]s | -6 ; 
1 
[a], = ( a ) 


6. It would be a vector space since all the condition would be true since they 
are true in V and W. So just check it. 
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7. 


10. 


11. 


12. 


13. 


If we have ({7']})i; = Aij, this means 


T(v;) = a Ajj Wij. 
t=1 


And hence we have a 
aT (vj) = > aAyjui. 
i=l 
and thus (a[T] 3 is a aAj;. 


. If B= {v1,v2,...,Un} and x = Yi, av; and y = Yi, b;v;, then 


T(x+cy) = T(>) ayvjte >> byv;) = (a1teb1, ag+cb2,...,an+cbn) = T(x) +cT(y). 
i=l i=l 


. It would be linear since for c ¢ R we have 


T((a1+ty1)+c(@2+iy2)) = (ay +cx2)+i(xotcy2) = T(x, +1y,)+cT (xotiy2). 


And the matrix would be 


It would be 
1 1 O 0 
0 1 1 : 
0 0°: 0 
: moot 
Ot sek ct. OE 


Take {v1, v2,...,Ux~} bea basis of W and extend it to be 6 = {v1, v2,...,Un}, 
the basis of V. Since W is T-invariant, we have T(v,;) = oy ayjv; if 7 = 
1,2,...,k. This means ([T]g),; = Oifj =1,2,...,kandi=k+1,k+2,...,n. 


Let {v1,02,..-,Ue} and {Up41, Ue+2,---,Un} be the basis of W and W’ 
respectly. By Exercise 1.6.33(b), we have 3 = {v1,v2,...,Un} is a basis of 


V. And thus we have 
_{ Ik O 
[T]e - ( OO 


is a diagonal matrix. 


Suppose, by contradiction, that cT = U for some c. Since T is not zero 
mapping, there is some x € V and some nonzero vector y € W such that 
T(x) = y #0. But thus we have y = +cy = 4U(x) = U(4a) € R(U). This 
means y € R(T) n R(U), a contradiction. 
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14. 


15. 


16. 


It can be checked that differentiation is a linear operator. That is, T; is 
an element of £(V) for all i. Now fix some n, and assume 7), aiTj = 
0. We have T;(«”) = ene and thus {7;(x”)}i<1,2,..... would be an 
independent set. Since 1°7., a;T;(x”) = 0, we have a; = 0 for all i. 


(a) We have zero map is an element in S°. And for T,U ¢ 9°, we have 
(T+ cU)(x) = T(x) +cU(x) =O if re S. 

(b) Let T be an element of S$. We have T(x) = 0 if x € 9; c Sq and 
hence T is an element of S$). 

(c) Since V,; + Vo contains both V; and V2, we have (V; + V2)° c V2 An Vy 
by the previous exercise. To prove the converse direction, we may 
assume that T « V2 n V3’. Thus we have T(x) = 0 if x ¢ Vi or 
ve Vy. For z=utve Vy + Vo with ue Vy and v € Vo, we have 
T(z) =T(u) + T(v) =0+0=0. So T is an element of (V; + V2)° and 
hence we have (Vi + V2)° 3 VP A Vey. 


As the process in the proof of Dimension Theorem, we may pick the same 
basis 6 = {v1,V2,...Un} for V. Write ugs1 = T(ve+i1- It has been proven 
that {up41, Up+2;--+;Un} is a basis for R(T). Since dim(V) =dim(W), we 
can extend to a basis y = {t1,U2,...,Un} for W. Thus we have 


{OO 
a(G a) 


is a diagonal matrix. 


2.3. Composition of Linear Transformations and 


1. 


Matrix Multiplication 


(a) It should be [UT]} = [T]2[UJ}. 

(b) Yes. That’s Theorem 2.14. 

(c) No. In general ( is not a basis for V. 
(d) Yes. That’s Theorem 2.12. 

(e) No. It will be true when 8 =a. 


1 0 


(g) No. T is a transformation from V to W but L,4 can only be a 
transformation from F” to F”. 


2 
(f) No. We have ( - a =I. 


2 
0 1 
(h) No. We have 0 0 a. 


(i) Yes. That’s Theorem 2.15. 
j) Yes. Since 6;; = 1 only when 7 = 7, we have A;; = 6,7. 
J j y J J J 
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3. 


A(2B +3C) = & Pe a 


10 


(AB)D = A(BD) = ( ee 


8 


—26 


1 1 0 
(a) We can calculate that [U]} = | 0 0 1 | and [T], 
1 0 


and finally 


(b) We can calculate [h(x)]g = | -2 and 


[U(h(2)) ]o = TD 


1 
- (a) med 


a 
6 
-6 
[T(F(2))]a a : 
6 


(c) [T(A)], 
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) 


) 


; 


5. 


10. 


(d) [T(F(z))]y=( 12 ). 
(b) We have 


(a(AB))ig =a) Ain Brj 
k=1 


n 


> @Ain Bas = ((aA)B) a 


k=1 


Me 


AjxaByj = (A(aB))ij 


k=1 


(d) We have [I(v;)]a = e; where v; is the i-th vector of 3. 
Corollary. We have by Theorem 2.12 


k k k 
A()> a; B;) a >> A(aiBi) = >) a; AB; 
1=1 1=1 1=1 


and ‘ ' ; 
i=1 i=1 i=1 


. We have (Be;); = Yyp-1 Biz (e;)i = Bij, since (e;); = 1 only when i = j and 


it’s 0 otherwise. 


. (c) Just check that for all vector v ¢ F” we have L4.p(v) = (A+ B)v = 


Av+ Bu=La(v)+ Le(v) and Laa(v) = aA(v) = aL a(v). 
(f) For all vector v € F” we have Ly, (v) = In(v) =v. 


. In general we may set T,,T> ¢ £L(X,Y) and U;,U2 « L(W,X), and S « 


(V,W), and thus we have the following statements. 


T,(U, + Uz) = TU, + TU2 and (U, + U2)T = U,T + UoT. 
T\(U,S) =(TU1)S. 

TIx =lyT=T. 

a(T,U1) = (aT,)U, = T\(aU;) for all scalars a. 


(a 
(b 
(c 
(d 


Ney NI Ne NES 


To prove this, just map arbitrary vector in domain by linear transforma- 
tions and check whether the vectors producted by different transforma- 
tions meet. 


0 1 0 0 


Take A= Dd ) ana B=( Eo J and U = and T= Lo, 


If A is a diagonal matrix then A;; + 0 only when i = 7. Hence we have 
Aj; = 6;;A;;. Conversely, if A is not diagonal, we can find A,; # 0 for some 
a9 and i# I: Thus we have 655 Aaj =0# Ajj. 
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11. If T? = T we may pick y ¢ R(T) and thus we have y = T(x) for some 
x and T(y) = T(T(a)) = T?(x) = 0. Hence we conclude that y « N(T). 
Conversely if we have R(T) c N(T), we have T?(x) = T(T(x)) = 0 since 
T(a) is an element in R(T’) and hence in N(T). 


12. (a) If UT is injective, we have that UT(x) = 0 implies x = 0. Thus 
we have that if T(a2) = 0 we also have UT(x) = 0 and hence z = 0. 
So T is injective. But U may not be injective. For example, pick 
U(2,y,z) =(2,y), a mapping from R® to R?, and T(z, y) = (2, y,0), 

a mapping from R? to R°. 
(b) If UT is surjective, we have that for all z € Z there is a vector ¢V 
such that UT(x) = z. Thus we have that if for all z ¢ Z we have 
z = U(T(x)) and hence U is surjective. But T may not surjective. 
The example in the previous question could also be the example here. 


For all z € Z, we can find z = U(y) for some y € W since U is surjective 
and then find y = T(x) for some x € V since T is surjective. Thus 
we have z = UT(x) for some x and hence UT is surjective. On the 
other hand, if UT(x) = 0, this means T(x) = 0 since U is injective 
and x = 0 since T is injective. 


— 
lo) 
WN 


13. It’s natural that we have tr(A) =tr(A‘) since A;; = Af, for all i. On the 
other hand we have 


i=1 i=l k=1 
=> > BriAiz = >> (BA)kx 
k=1ik=1 k=1 
= tr(BA) 
14. (a) We can write 
a1 05 Bij 
Beu| 3-2 47B23 
21 95 Bj 
Bi; 
P F P 
= 045 eB = 7450}. 
L L 
J Be J 


(b) This is the result of Theorem 2.13(b) and the previous exercise. 
(c) This is instant result of the fact that wA = Aw. 
(d) This is also because AB = B‘ A’. 
15. Let v4, be the t-th column vector of A and we have v; = Dtej AtUt- Thus we 


have Mv; = Y44;a+Mv;. And hence we get the desired result since Muy 
is the column vector of MA. 
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16. 


(a) Since we know R(T) is a T-invariant space, we can view T as a 


a 


mapping from R(T) to R(T) and call this restricted mapping T| pr). 
So now we have that 


dimR(T) = rank(T) = rank(T”) = dim(T(T(V)) 


= dim(T(R(T)) = rank(T|,(r)). 


And so the mapping T|pcr) is surjective and hence injective with 
the help of the fact R(T) is finite dimensional. This also means 
N(T|acr) = R(L) 9 N(T) = 0. This complete the proof of the first 
statement. For the other, it’s sufficient to say that R(T)+N(T) =V. 
But this is instant conclusion of the fact that R(T) + N(T) c V and 
that 


dim(R(T) + N(T)) = dim(R(T)) + dim(N(T)) - dim(R(T) n N(L)) 
= dim(R(T)) + dim(N(T)) = dim(V). 


In general we have rank(T'**!) <rank(T*) since the fact T**'(V) = 
T*(R(T)) c T*(V). But the integer rank(7*) can only range from 0 
to dim(V). So there must be some integer k such that 


rank(T") = rank(T**"). 


And this means T**1(V) = T*(V) and hence T*(V) = T*(V) for all 
s>k. Since 2k >k, we can conclude that rank(T") =rank(T?") and 
hence we have V = R(T") @ N(T*) by the previous exercise. 


17. If T = T?, then we have V = {y: T(y) = y} + N(T) since x = T(x) + (a - 


T(a)) and we have T(T(a)) = T(x) and T(x - T(a)) = T(x) -T(a) =0. 
On the other hand, we have if « ¢ {y: T(y) = y}nN(T) then we also have 
x = T(«) =0. So by arguments above we have V = {y: T(y) =y} ® N(T). 
Finally we have that T must be the projection on W; along W2 for some 
W, and W2 such that W, @ Wo =V. 


18. Let A, B, and C be mxn, nx p, and px q matrices respectly. Next we 


want to claim that (AB)C = A(BC) since 


((AB)C) i; = 2 (AB) ixCry = 3 (> AuBir)Crj 


k=1 k=1 [=1 
Pon n Pp 

=> Y AuBuCrs = >> YS AuBurCr; 
k=l l=1 i=l k=1 


Sy 


BirCrj) = > Au( BC) = (A(BC))i5- 
Il) kel isl 


For the following questions, I would like to prove them in the languague 
of Graph Theory. So there are some definitions and details in Appendices. 
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19. 


20. 


21. 


22. 


23. 


Let G = G(B) be the graph associated to the symmetric matric B. And 
(B?),; is the number of walk of length 3 from i to i. If i is in some clique, 
then there must be a walk of length 3 from 7 back to 7 since a clique must 
have number of vertex greater than 3. Conversely, if (B?),; is greater than 
zero, this means there is at least one walke of length 3 from i to 7, say 
i>j—2>k-—i. Note that 7, 7, and & should be different vertices since 
length is 3 and there is no loop. So 7, 7, and k must be a triangle, this 
means three vertices adjacent to each others. So 7 is contained in {7, 7, k} 
and so contained in some clique. 


We can draw the associated digraph and find the cliques as follow: 


1 2 1 2 


(a) (b) 


(a) There is no clique. 
(b) The only clique would be the set {1,3, 4}. 


A vertex v in a tournament is called a king if v can reach all other vertices 
within two steps. That is, for all vertex u other than v, we have either 
v>uorv>w = u for some w. So (A+ A”);; > 0 is equivalent to that 
7 can reach j within two steps. And the statement of this question also 
means that every tournament exists a king. 


To prove this statement, we can begin by arbitrary vertex v1. If vi is a 
king, then we’ve done. If v, is not a king, this means v; can not reach 
some vertex, say v2, within two steps. Now we have that d* (v2) > d* (v1) 
since we have vg > vy and that if v; > w for some w then we have v2 > w 
otherwise we'll have that v; > w > v2. Continuing this process and we 
can find d*(v,) < d*(v2) <--- and terminate at some vertex vz since there 
are only finite vertces. And so vu; would be a king. 


We have G = G(A) is a tournament drawn below. And every vertex in 
this tournament could be a king. 


The number of nonzero entries would be the number of the edges in a 
tournament. So it would be n(n —- 1)/2. 


Al 


2.4 Invertibility and Isomorphisms 


1. 


ID oo ow 


(a) No. It should be ([T]8)7? = ama ee 
(b) Yes. See Appendix B. 

(c) No. £4 can only map F” to F™. 
(d) No. It isomorphic to F°. 

(e) Yes. This is because P,(F) 2 F”. 


1 0 0 


(f) No. We have that ( 010 


1 0 
\(: | Yt and 
0 0 


invertible since they are not square. 


(g) Yes. Since we have both A and (A7')7! are the inverse of A~', by 
the uniqueness of inverse we can conclude that they are the same. 


(h) Yes. We have that L4-1 would be the inverse of Ly. 
Yes. This is the definition. 


No. They have different dimension 2 and 3. 


) 
) 
(a) 
(b) No. They have different dimension 2 and 3. 
) 
) 
) 


ai 4 1 1 1 
Yes. T”*(a1, 2,43) = (—3a2 + 343, @2,—541 — 2a2 + 543). 


No. They have different dimension 4 and 3. 
(e) No. They have different dimension 4 and 3. 


(f) Yes. | 2) : as } 


No. They have different dimension 3 and 4. 


) 

b) Yes. They have the same dimension 4. 
) Yes. They have the same dimension 4. 
) 


(d) No. They have different dimension 3 and 4. 


. This is because that (B~'A7')(AB) = (AB)(B1A™) =I. 
. This is because that (A7')' At = (AA7!)* = I and A*(A™!)' = (ATA) =. 
. If A is invertible, then A~! exists. So we have B = A-1AB=A10=0O/ 


(a) With the result of the previous exercise, if A is invertible we have 
that A=O. But O is not invertible. So this is a contradiction. 


(b) No. If A is invertible then B = O by the previous exercise. 


. For Corollary 1 we may just pick W = V and a= (. For Corollary 2 we 


may just pick V =F” and use Corollary 1. 
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9. 


10. 


11. 


12. 


13. 


If AB is invertible then Lg is invertible. So L4lg = Lag is surjective 
and injective. And thus Ly, is injective and Lg surjective by Exercise 
2.3.12. But since their domain and codomain has the same dimension, 
actually they are both invertible, so are A and B. 


(a) Since AB = I, is invertible, we have A and B is invertible by the 
previous exercise. 


(b) We have that AB = I, and A is invertible. So we can conclude that 
A= AT, = AT AB=B, 


item Let T is a mapping from V to W and U is a mapping from W 
to V with dimW =dimV. If TU be the identity mapping, then both 
T and U are invertible. Furthermore T~! = U. 


To prove this we may pick bases a of V and 6 of W and set A= [T]® 


and B =[U]%. Now apply the above arguments we have that A and 
B is invertible, so are T and U by Theorem 2.18. 


If T(f) = 0 then we have that f(1) = f(2) = f(3) = f(4) = 0, then we have 
that f is zero function since it has degree at most 3 and it’s impossible to 
have four zeroes if f is nonzero. 


We can check ¢g is linear first. For x = ))7., ajv; and y = Y7_, b;u;, where 


B = {v1,02,.--,Un} 
we have that 
a, + cby ay by 
dala rey)=| PLM Jat & Jae] % |= os(x) +edaCu). 
Qn ri chy, the be 
0 
And we can check whether it is injective and surjective. If ¢g(x) = : 
0 
ay 
then this means x = )77_, Ov; = 0. And for every ve +c in F”, we have 
an 


that x2 = Yj, a;1; will be associated to it. 


First we have that V is isomorphic to V by identity mapping. If V is 
isomorphic to W by mapping T, then T~! exist by the definition of iso- 
morphic and W is isomorphic to V by T~?. If V is isomorphic to W by 
mapping T and W is isomorphic to X by mapping U, then V is isomorphic 
to X by mapping UT. 
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14. Let 
1 1 0 1 0 0 
e-(oo}(o o}(o sp 
be the basis of V. Then we have that @g in Theorem 2.21 would be the 
isomorphism. 


15. We have that 7 is isomorphism if and only if that T is injective and 
surjective. And we also have that the later statement is equivalent to 
T(G) is a baiss for W by Exercise 2.1.14(c). 


16. We can check that ® is linear since 
®(A+cD) = B'(A+cD)B = B'(AB+cDB) 


= B'AB+cB'DB = ®(A) +c®(D). 
And it’s injective since if ®(A) = B™!AB =O then we have A = BOB™! = 
O. It’s also be surjective since for each D we have that 6(BDB™') = D. 


17. (a) If y1,y2 € T(Vo) and yy = T(x1), yo = T (a2), we have that y1 + y2 = 
T(a1 +22) € T(Vo) and cy; = T(cv,) = T(Vo). Finally since Vo is a 
subspace and so 0 = T(0) € T(Vo), T(Vo) is a subspace of W. 


(b) We can consider a mapping T” from Vo to T'(Vo) by T’(x) = T (a) for 
all x € Vo. It’s natural that T’ is surjective. And it’s also injective 
since T is injective. So by Dimension Theorem we have that 


dim(Vo) = dim(N(T’)) + dim(R(T’)) = dim(T(V)). 


18. With the same notation we have that 


010 0 
Lagg(p(2))=] 0 0 2 0 
000 3 


al Oa 
Il 
—x_~_ 


and 


i 
byT (p(x)) = (1 + 4x + 327) = | 4 
3 


So they are the same. 


19. (a) It would be 


ooo rF 
orooe 
oOoroO 
poo © 
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(b) We may check that 


Lada 5 : )- 


ooo Fe 
oroo°e 
O20.) 
ee = ee es 
Ce Oe 
BPN We 


and 
1 
eo ines: 3 
ae i }=e0( 3 i} 5 
4 


So they are the same. 


20. With the notation in Figure 2.2 we can prove first that ¢,(R(T)) = 


21. 


La(F"). Since ¢g is surjective we have that 
Da(F") = Ladp(V) = 6,T(V) = 6,(R(T)). 


Since R(T) is a subspace of W and ¢, is an isomorphism, we have that 
rank(T’) =rank(Z4) by Exercise 2.4.17. 


On the other hand, we may prove that ¢g(N(T)) = N(La). If y «€ 
og(N(T)), then we have that y = ¢g(x) for some « ¢ N(T) and hence 


Daly) = La(Ga(2)) = 6yT (2) = 6,(0) = 0. 


Conversely, if y ¢ N(Za), then we have that La(y) = 0. Since ¢¢ is 
surjective, we have y = ¢3(x) for some x € V. But we also have that 


oy(T(«)) = Lala ()) = La(y) = 0 


and T(x) =0 since @y is injective. So similarly by Exercise 2.4.17 we can 
conclude that nullity(T) =nullity(Z,). 


First we prove the independence of {T7;;}. Suppose that ); ; aijTij = 0. 
We have that 


ex aij liz) (Ue) = > ai; Tix (ve) = YD ainw; =0. 
4g 4 a 


This means a;x, = 0 for all proper i since {w;} is a basis. And since k is 
arbitrary we have that a;z, = 0 for all z and k. 


Second we prove that [T;;]} = M’’. But this is the instant result of 
Tij (vj) = W; 
and 
T,; (vz) = 0 
for k + j. Finally we can observe that ®(3) = y is a basis for Mmxn(F) 
and so ® is a isomorphism by Exercise 2.4.15. 
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22. It’s linear since 
T(f +g) = ((f + eg) (co), (f + eg)(e1),--- (Ff +.c9)(en)) 
= (f(co) + cg(co), Fler) + eg(er),--- f(r) + egen)) = T(f) + eT (9). 


Since T(f) = 0 means f has n+1 zeroes, we know that f must be zero func- 
tion( This fact can be proven by Lagrange polynomial basis for P,,(F).). 
So T is injective and it will also be surjective since domain and codomain 
have same finite dimension. 


23. The transformation is linear since 


T(a+cr) = XG +er)(i)a* 


= Dati! +cr(i)z" = T(o)+cT(r), 


where m is a integer large enough such that o(k) = r(k) = 0 for all k > m. 
It would be injective by following argument. Since T(o) = 7.) 0(i)x’ = 0 
means o(7) = 0 for all integer 7 < n, with the help of the choice of n we can 
conclude that o = 0. On the other hand, it would also be surjective since 
for all polynomial )°7) a;2° we may let o(z) = a; and thus T will map o 

to the polynomial. 
24. (a) Ifv+ N(T) =v'+ N(T), we have that v—v’ ¢ N(T) and thus T(v) - 

T(v') =T(v- 0") = 0. 
(b) We have that 


T((v+ N(T)) +c(u+ N(T))) =T((v+ cu) + N(T)) 
=T(v+cu) =T(v) +cT(w). 


(c) Since T is surjective, for all y€ Z we have y = T(x) for some x and 
hence y = T(x+N(T)). This means T is also surjective. On the other 
hand, if T(x + N(T)) = T(a) = 0 then we have that 2 ¢ N(T) and 
hence x + N(T) =0+ N(T). So T is injective. With these argument 
T is an isomorphism. 


(d) For arbitrary x ¢ V, we have 
T, (2) =T (2+ N(T)) =T(z). 


25. The transformation WV would be linear since 


Wfteg)= Di (fteg)(s)s= DY fls)steg(s)s 


(f+eg)(s)#0 (f+eg)(s)#0 


= Di f(s)steg(s)s= =D f(s)te DT gs)s 


(f or cg)(s)#0 (f or cg)(s)#0 (f or cg)(s)#0 
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= W(f) +cW(). 


It will be injective by following arguments. If U(f) = Y¢(s)20 f(s)s = 0 
then we have that f(s) = 0 on those s such that f(s) #0 since {s: f(s) #0} 
is finite subset of basis. But this can only be possible when f = 0. On 
the other hand, we have for all element x ¢ V we can write x = ¥}; a;s; for 
some finite subset {s;} of S. Thus we may pick a function f sucht that 
f(s;) = a; for all i and vanish outside. Thus © will map f to xz. So W is 
surjective. And thus it’s an isomorphism. 


2.5 The Change of Coordinate Matrix 


1. (a) No. It should be [2j]. 

(b) Yes. This is Theorem 2.22. 

(c) Yes. This is Theorem 2.23. 

(d) No. It should be B=Q™'AQ. 

(e) Yes. This is the instant result of the definition of similar and Theorem 
2.23. 


2. For these problem, just calculate Lie. 


@ (op). 


AT 


Or 


. Let @ be the standard basis ( 


and , 
[T]e = [13 [(T]elz]5, 
Dy 2 2 1 <1 
“(4 1 es aN a 
(8 1 
(59) 
. We have that 


and 


Nie 
| Nike 
Nie 
a: —— 
ae 
o°oO 
ee © 
—— 
as 
ao 
Po 
—_ 
—— 


of F? or F?). We have that A = [La]a 
and hence [L4]g = [Te [Lalo[1]¢. So now we can calculate [L4]g and 
Q =[1]j and Q™* = [J]a. 


7. We may let 3 be the standard basis and a = {(1,m),(—m,1)} be another 
basis for R?. 


(a) We have that [T]a -( ; and Q-! = [2 -( = : ) We 


also can calculate that Q = [I] m*+1  mit+l |. So finally we 


WR 
ll 
| 


get 


: > x+2ym-a2m? -y+2am+ym? 
That is, T(2,y)=(—4"., so 


1 0 
0 0 


(b) Similarly we have that [T]a = ( ) And with the same @ and 


Q-' we get 


That is, T(a, y) = (22@, 2). 


m24+1?  m?+1 


8. This is similar to the proof of Theorem 2.23 since 
Tel 2 (le a PO 


9. We may denote that A is similar to B by A ~ B. First we have A = 
I-'AI and hence A ~ A. Second, if A ~ B we have A = Q7!BQ and 
B=(Q)'AQ"! and hence B ~ A. Finally, if A~ B and B ~ C then we 
have A = P“'BP and B = Q7'CQ. And this means A = (QP)7'C(QP) 
and hence A~ C. So it’s a equivalence relation. 


10. If A and B are similar, we have A = Q-' BQ for some invertible matrix Q. 
So we have 


tr(A) = tr(Q7"BQ) = tr(QQ"'B) = tr(B) 
by Exercise 2.3.13. 


11. (a) This is because 


(b) This is because 


12. This is the instant result that A =[L,]g and Q defined in the Corollary 
is actually Fale 
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13. 


14. 


Since @ is invertible, we have that Lg is invertible. We try to check f’ is 
an independent set and hence a basis since V has dimension n. Suppose 
that pie a;x, =0. And it means that 


t=1 j= 


245 2, Qi =>) (2, 45 Qi) =0. 
j=l i= 


Since £ is a basis, we have that Viet a;Qi; = 0 for all i. Actually this is a 
system of linear equations and can be written as 


Qu Qi2 “oa Qin 


(a1 ay an ) Qa (22 . an 29020, 
Qni Qn2 oe Qnn 
where v = ( ay ag ... Ay he But since Q is invertible and so Qu! 


exist, we can deduce that v = vQQ™! = 0Q7! = 0. So we know that @ is 
a basis. And it’s easy to see that Q = bale is the change of coordinate 
matrix changing 6’-coordinates into 6-coordinates. 


Let V = F"”, W =F”, T = Ly, 6, and y be the notation defined in the 
Hint. Let 6’ and 7’ be the set of column vectors of Q and P respectly. By 


Exercise 2.5.13 we have that 3’ and y’ are bases and Q = alae P=[I]},. 


Since we have that eae = Bae degra we have that B = P~'AQ. 


2.6 Dual Spaces 


1. 


De 


No. Every linear functional is a linear transformation. 
Yes. It’s domain and codomain has dimension 1. 
Yes. They have the same dimension. 


Yes. It’s isomorphic to the dual space of its dual space. But if the 
“is” here in this question means “equal”, then it may not be true 
since dual space must has that its codomain should be F. 


(e) No. For an easy example we may let T be the linear transformation 
such that T(«;) = 2f;, where 6{x1,22,...,2,} is the basis for V and 
Bt fi, fo,---; fn} is the corresponding dual basis for V*. 


(f) Yes. 
(g) Yes. They have the same dimension. 
(h) No. Codomain of a linear functional should be the field. 


In these question we should check whether it’s linear and whether its 
domain and codomain are V and F respectly. 


50 


(a) Yes. We may check that 
f(p(@) + eq(a)) = 2p"(0) + 2ep’(0) + p"(1) + eq'(1) 
2p'(0) + p"(1) + e(2p"(0) + g"(1)) = F(p(@)) + ef (q(x). 


(b) No. It’s codomain should be the field. 
(c) Yes. We may check that 


tr(A+cB) = > (A+ cB). 


a=1 
= >, Ajj + cB; = tr(A) - ctr(B). 
1=1 


(d) No. It’s not linear. 
(e) Yes. We may check that 


H(p(2) + ea(2)) = f (w(t) + eal t))a 


= [pita +e [ altdat = F((@)) + ef a(x). 
(f) Yes. We may check that 
f(A+ cB) = (A+cB)i = An +cBi = f(A) +ef(B). 
3. (a) We may find out that for all vector (2, y,z) € R? we can express it as 
(@,y.2) = (@- $)(1,0,1) + 5 (1,2,1) + (@-#)(0,0, 1), 
So we can write 


fo(z,y, 2) = $3 
f3(2,y,2) Seis 


filz,y,2) =2- §; 


(b) This is much easier and we have that 


filao + a,x + agx”) = ao; 
fo(ao + a1x + agx”) = a; 
f3(ao + a1@ + G22”) = ag. 


4. We may returen the representation such that 


). 


2 3 1 3.3 #1 1 1 3 
= 2 +(r+y+ +(y-3 
(2,9,2) = (@-2W)(2,->,- Fe (aryen(2, 5, Dyeyaey(d a - 
We may check that the set {(2, x, a) 2, *, aa) (E a *)} is a 


basis and hence the desired set. By Theorem 2.24 {f1, fo, fg} is a basis 
for V*. 
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5. Assume that p(t) = at+bax. We have the (a + bt)dt = a+% and is (a+ bt)dt = 


6. 


7. 


2a+2b. So we may returen the representation such that 


at bu = (a+ 3 )(2- 22) + (2a + 26)(-5 +2). 


We may check that the set {2-2z, -5 +x} is a basis and hence the desired 
set. By Theorem 2.24 {f1, fo, fg} is a basis for V*. 


(a) Calculate directly that 


T'(f) (x,y) = fT (x,y) = f(Bx + 2y, x) = Ta + 4y. 


(b) Since 8 = {(1,0),(0,1)} and (a,y) = 2(1,0) + y(0,1), we have that 


fila, y) =a and fo(x,y) = y. So we can find out that 
T'(fi)(2,y) = fiT (x,y) = fir 3at2y, x) = 3at2y = 3f1(2,y)+2fo(a, y); 
T'(fo)(2,y) r foT(2,y) = fo(3x + 2y, x) == 1fi(z,y) + Ofo(a,y). 


And we have the matrix [T"]g« = ( ; ; i 


(c) Since T(x, y) = (34 + 2y,x), we can calculate that 


and 


So we have that [T"]g+ = ({T])’. 


(a) Calculate directly that 


T'(f)(a+ br) = fT (a+ bx) = f(-a-2b,a+ b) = -3a — 4b. 


(b) Since 6 = {1,r} and a+ br =ax1+bx 2, we have that fi(a+br) =a 


and fo(a+b«) = b. And since y = {(1,0), (0,1)} and (a,b) = a(1,0) + 
b(0, 1), we have that gi(a,b) =a and go(a,b) = b. So we can find out 
that 


T'(g1)(a+ bx) = gi T(a+ bx) = g1(-a- 2b,a +b) = -a- 2b 


= -1x gi(a,b) + (-2) x go(a,b); 
T'(g2)(at br) = g2T (a+ bx) = go(-a-2b,a+b)=a+b 
=1xgqi(a,b) +1 go(a,b). 


iimnommnets a 
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8. 


10. 


(c) Since T(a + bx) = (-a - 2b,a + 6), we can calculate that 


and 


So we have that egal = ((T]3)’. 


Every plane could be written in the form P = {(2,y, z) : ax+by+cz = 0} for 
some scalar a, b and c. Consider a transformation T(2, y, z) = ax + by+cz. 
It can be shown that T is an element in (R*)* and P = N(T). For the 
case in R?, actually every line has the form L = {(z,y) : ax + by = 0} and 
hence is the null space of a vector in (R?)*. 


. If T is linear, we can set f; be g;J' as the Hint. Since it’s conposition of 


two linear function, it’s linear. So we have 
T(x) = (gi(2()), g2(1(@)),-- «5 9m(T(z))) 


= (fil), fo(@), +++; fm(2)). 


have that T(x) = D3") fi(a)ei with f, linear, we can define T;(x) = f;(x)e; 
and it would be a linear transformation in “(F",F™). Thus we know T 
is linear since T’ is summation of all 7;. 


(a) Since we can check that f;(p() + cq(x)) = p(c) + cq(ci) = fi(p(@)) + 
cfi(q(x)), fi is linear and hence in V*. And we know that dim(V*) = 
dim(V) = dim(P,(F)) = n+ 1. So now it’s enough to show that 
{ fo, fi,---,fn} is independent. So assume that )Y"., a; f; = 0 for some 
a;. We may define polynomials p;(x) = T]j2;(a2-c¢j) such that we 
know p;(c;) #0 but p;(c;) = 0 for all 7 #7. So now we have that 


3 ai fi(pi) = a1 fi(pi) =0 
a=1 


implies a; = 0. Similarly we have a; = 0 for all proper 7. 


(b) By the Corollary after Theorem 2.26 we have an ordered basis 


6B = {p0,P1,---,Pn} 


for V such that {f1, fo,.-., fn} defined in the previous exercise is its 
dual basis. So we know that p;(c;) = 6;;. Since 6 is a basis, every 
polynomial in V is linear combination of 3. If a polynomial qg has the 
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property that q(c;) = 60;, we can assume that q = Dj aipi. Then we 
have 


1 = q(co) = > aipi(co) = a1 
i=0 


and . 
O= q(ej) =D) aipi(c) = a; 
i=0 
for all 7 other than 1. So actually we know q = pp. This means 
po is unique. And similarly we know all p; is unique. Since the 
Lagrange polynomials,say {1;}i=1,2,..n, defined in Section 1.6 satisfy 
the property r;(c;) = 6;;, by uniqueness we have r; = p; for all 7. 


(c) Let 6 = {po,p1,.--,Pn} be those polynomials defined above. We may 
check that 


g(x) = Y aipi(2) 


has the property q(c;) = a; for all 7, since we know that p;(c;) = 6;;. 
Next if r(z) € V also has the property, we may assume that 


i=0 
since (6 is a basis for V. Similarly we have that 
ay = r(c;) = >) bipi(ci) = b;. 

i=0 


So we know r= q and q is unique. 

(d) This is the instant result of 2.6.10(a) and 2.6.10(b) by setting a; = 
p(c). 

(e) Since there are only finite term in that summation, we have that the 
order of integration and summation can be changed. So we know 


b b n 
[ p@at= f Yrledpi(e)yat 
a a i=0 


n b 
=f, rleidwi(tae 


. It will be more clearer that we confirm that the domain and codomain of 
both y.2T and Tw. are V and W** respectly first. So for all x « V we 
have 


beT (x) = o(T(x)) = T(x) « W** 
and 
TY (a) = T" (2) 
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12. 


13. 


= (T*)*(@) = 4T* «Ww. 
But to determine whether two elements f and g in W** are the same is 


to check whether the value of f(h) and g(h) are the same for all he W*. 
So let h be an element in W». Let’s check that 


T(x)(h) = A(T(2)) 


and 
&T" (h) = &(hT) = h(T(a)). 


So we know they are the same. 


Let 6 = {a1,%2,...,2%n} be a basis for V. Then we know the functional 
¢,¢«V** means £;(f) = f(a;) for all funtional f in V*. On the other hand, 
we have the dual basis 8* = {fi, fo,..., fn} is defined by fi(x;) = 6;; for 
alli = 1,2,...,n and j = 1,2,...,n such that f; is lineaer. And we can 
further ferret what elements are in 6**. By definition of 6** we know 
Be" ={F\, F,...,F,} and F,(f;) = 6;; and F; is linear. So we may check 
that whether F; = #; by 


£i( fj) = fy(wi) = big = Fi(fy)- 


Since they are all linear functional and the value of them meets at basis 
8, they are actually equal by the Corollary after Theorem 2.6. 


(a) We can check that f +g and cf are elements in S° if f and g are 
elements in S° since (f + g)(x) = f(x) + g(x) = 0 and (cf)(x) = 
cf (a) =0. And the zero function is an element in S°. 

(b) Let {v1,v2,...,0~} be the basis of W. Since « ¢ W we know that 
{v1,V2,...,Ug41 = x} is an independent set and hence we can extend 
it to a basis {v1,v2,...,Un} for V. So we can define a linear trans- 
formation T such that f(v;) = dic¢41). And thus f is the desired 
functional. 


(c) Let W be the subspace span($). We first prove that W° = 9°. Since 

every function who is zero at W must be a function who is zero at 
S. we know W° c $°. On the other hand, if a linear function has 
the property that f(a) = 0 for all x € S, we can deduce that f(y) =0 
for all y¢ W =span(S). Hence we know that W° 5 S° and W® = 9°. 
Since (W°)° = ($°)° and span(7(S)) = 1(W) by the fact w is an 
isomorphism, we can just prove that (W°)° = ~(W). 
Next, by Theorem 2.26 we may assume every element in (W°)° c V** 
has the form # for some x. Let # is an element in (W°)°. We have 
that £(f) = f(x) = 0 if f « W°. Now if x is not an element in W, 
by the previous exercise there exist some functional f ¢« W° such 
that f(a) #0. But this is a contradiction. So we know that @ is an 
element in U(W) and (W°)® c ¥(W). 
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14. 


15. 


16. 


For the converse, we may assume that # is an element in 7)(W). Thus 
for all f ¢ W° we have that #(f) = f(x) =0 since x is an element in 
W. So we know that (W°)° 5 7(W) and get the desired conclusion. 


(d) It’s natural that if W, = W2 then we have W? = W3). For the converse, 
if WP = WS then we have 


&(Wi) = (Wy)? = (W3)° = v2) 


and hence 
W, = 0 '(W1) = Yd (Wa) = We 
by the fact that w is an isomorphism. 


(ec) If f is an element in (W; + W2)°, we have that f(w1 + we) = 0 for 
all w; € W, and we € Wa. So we know that f(wi +0) = 0 and 
f(0+we2) = 0 for all proper w; and wa. This means f is an element in 
W?nW3. For the converse, if f is an element in W?n W’, we have 
that f(wr + we) = f(wi) + f(we2) = 0 for all Wi € Wi and We € Wo. 
Hence we have that f is an element in (W1 + W2)°. 


We use the notation in the Hint. To prove that a = { frit, fe+2,---,fn} 
is a basis for W°, we should only need to prove that span(a) = W® since 
by ac 8* we already know that a is an independent set. Since W° c V*, 
every element f € W?° we could write f= afi. Next since for 1 <i<k 
x; is an element in W, we know that 


0= f(a) = Yafile) = Qj. 


So actually we have f = 37,4; aif; is an element in span(a). And finally 
we get the conclusion by 


dim(W) + dim(W 9) = &+ (n-k) =n=dim(V). 


If T'(f) = fT = 0, this means f(y) = 0 for all y ¢ R(T) and hence f «€ 
(R(T))°. If f ¢ (R(T))°, this means f(y) = 0 for all y ¢ R(T) and hence 
T'(f)(x) = f(T(2)) = 0 for all x. This means f is an element in N(T"). 


We have that 
rank(L4) = dim(R(L4)) = m- dim(R(La)°) = m- dim(N((La)‘)) 
= dim((F”)*) - dim(N((La)’)) = dim(R((La)')). 


Next, let a, 6 be the standard basis for F” and F”. Let a*, 6* be their 
dual basis. So we have that [L4)']. = ([L4]@)* = A‘ by Theorem 2.25. 
Let ¢g+ be the isomorphism defined in Theorem 2.21. We get 


dim(R((L.a)‘)) = dim(@g+(R((La)‘))) = dim(R(L4+)) = rank(L4¢). 
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18. 


19. 


20. 


If W is T-invariant, we have that T(W) c W. Let f be a functional in 
W°®. We can check T"(f) = fT is an element in W® since T(w) « W by 
the fact that T-invariant and thus f(T(w)) = 0. 


For the converse, if W° is T‘-invariant, we know T*(W°) c W®. Fix one 
w in W, if T(w) is not an element in W, by Exercise 2.6.13(b) there exist 
a functional f « W° such that f(T(w)) #0. But this means T‘(f)(w) = 
fT(w) #0 and hence T*(f) ¢ W°. This is a contradiction. So we know 
that T(w) is an element in W for all w in W. 


First check that ® is a linear transformation by 


O(f +cg)(s) = (f +eg)s(s) = fs(s) + cgs(s) = (®(f) + c®(g))(s). 
Second we know ® is injective and surjective by Exercise 2.1.34. 


Let S’ is a basis for W and we can extend it to be a basis S for V. Since 
W is a proper subspace of V, we have at least one element t € S' sucht that 
t¢W. And we can define a function g in F(S,F) by g(t) = 1 and g(s) =0 
for all s ¢ S. By the previous exercise we know there is one unique linear 
functional f ¢ V* such that fg = g. Finally since f(s) = 0 for all s € S” we 
have f(s) =0 for alls¢ W but f(t) =1. So f is the desired functional. 


(a) Assume that T is surjective. We may check whether N(T") = {0} or 
not. If T’(f) = fT = 0, we have that f(y) = f(T(2)) =0 for all ye W 
since there exist some x € V such that T(x) = y. For the converse, 
assume that 7" is injective. Suppose, by contradiction, R(T) + W. 
By the previous exercise we can construct a nonzero linear functional 
f(y) « W* such that f(y) = 0 for all ye R(T). Let fo be the zero 
functional in W*. But now we have that T"(f)(x) = f(T(a)) =0= 
T'(g)(x), a contradiction. So T must be surjective. 

(b) Assume that 7” is surjective. Suppose, by contradiction, T(x) = 0 for 
some nonzero x € V. We can construct a nonzero linear functional 
g « V* such that g(x) # 0. Since T* is surjective, we get some 
functional f ¢ W* such that T'(f) = g. But this means 


O= f(T(x)) = T"(f)(z) = g(x) + 0, 

a contradiction. 

For the converse, assume that T is injective and let S' is a basis for V. 
Since T is injective, we have T(S) is an independent set in W. So we 
can extend it to be a basis S’ for W. Thus for every linear functional 
g¢V* we can construct a functional f ¢ W* such that T’(f) = g by 
the argument below. First we can construct a function h € F(S,F) 
by A(T(s)) = g(s) for s € S and A(t) = 0 for all te S‘\T(S). By 
Exercise 2.6.18 there is a lineaer functional f ¢ W* such that fg: = h. 
So now we have for all se S 


g(s) = A(T(s)) = f(T(s)) = T'(F)(s). 
By Exercise 2.1.34 we have g = T*(f) and get the desired conclusion. 
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2.7 Homogeneous Linear Differential Equations 


1. 


with Constant Coeficients 


(a) Yes. It comes from Theorem 2.32. 
(b) Yes. It comes from Theorem 2.28. 


(c) No. The equation y = 0 has the auxiliary polynomial p(t) = 1. But 
y = 1 is not a solution. 


(d) No. The function y = e' + e~' is a solution to the linear differential 
equation y” — y = 0. 


(e) Yes. The differential operator is linear. 


(f) No. The differential equation y” — 2y’ + y = 0 has a solution space of 
dimension two. So {e'} could not be a basis. 


(g) Yes. Just pick the differential equation p(D)(y) = 0. 


(a) No. Let W be a finite-dimensional subspace generated by the func- 
tion y =t. Thus y is a solution to the trivial equation Oy = 0. But 
the solution space is C° but not W. Since y“*) = 0 for k > 2 and it 
is impossible that 

ay’ + by=a+ bt =0 


for nonzero a, W cannot be the solution space of a homogeneous 
linear differential equation with constant coefficients. 


(b) No. By the previous argument, the solution subspace containing y = t 
must be C™. 


(c) Yes. If x is a solution to the homogeneous linear differential equation 
with constant coefficients whose is auxiliary polynomial p(t), then 
we can compute that p(D)(2’) = D(p(D)(«)) = 0. 


(d) Yes. Compute that 


p(D)q(D)(a + y) = (D)p(D)() + p(P)q(P)(x) = 0. 


t 


(ec) No. For example, e‘ is a solution for y’ - y = 0 and e™ is a solution 
t,t 


for y’ + y =0, but 1 = e’e™ is not a solution for y” — y = 0. 


3. Use Theorem 2.34. 


(a) The basis is {e~', te*}. 
(b) The basis is {1,e’,e7'}. 
(c) The basis is {e’, te’,e*, te}. 
(d) The basis is {e~', te}. 


(ce) The basis is {e~*, e®, e**}, where a is the complex value 1 + 2%. 


4. Use Theorem 2.34. 
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10. 


(a) The basis is {e,e°}, where a = Liv and 6 = Ely 
(b) The basis is {e*, te’, t?e"}. 
(c) The basis is {1,e77%,e~**}. 


. If f and g are elements in C™, then we know that the k-th derivative of 


f +g exists for all integer & since 
(f + a) = i + g™ 


So f +g is also an element in C®. Similarly, for any scalar c, the k-th 
derivative of cf exists for all integer k since (cf)() = cf. Finally, the 
function f = 0 is an element in C™ naturally. 


(a) Use the fact 
D(f + eg) = D(f) + eD(g) 
for functions f,g « C® and scalar c. This fact is a easy property 
given in the Calculus course. 


(b) If p(t) is a polynomial, then the differential operator p(D) is linear 
by Theorem E.3. 


. Let W and V be the two subspaces generated by the two sets {x,y} and 


{$(a+y), £(a- y)} separately. We know that W > V since $(a+y) and 
+(x -y) are elements in W. And it is also true that Wc V since 


1 i 
=—(r@+y)+—(2- 
r 5 (2 y) 54 y) 


and 


y=5(e+y)- gE (e-9) 


are elements in V. 


. Compute that et)! = ee’ = (cosbt + isinbt)e™. By Theorem 2.34 


and the previous exercise we get the result. 


. Since those U; are pairwise commutative, we may just assume that 7 = n. 


Hence if U,,(a) =0 for some x € V, then 


U,Ug:--Un (2) = U,U2:--Un-1(0) =0. 


Use induction on the number n of distinct scalar c;’s. When n = 1, the set 
{e™"} is independent since e! is not identically zero. Suppose now the 
set fe, e",...,e°'} is independent for all n < k and for distinct c¢;’s. 
Assume that 


k 
> ber! = 0. 
i=1 


11. 


12. 


Since any differential operator is linear, we have 
k k-1 
0 = (D 4 crl)(>> bye") = Py (c; i cp) bye" 
i=l i=1 


This means that (c; — cz,)b; = 0 and so b; = 0 for all i < k by the fact that 
c;’s are all distinct. Finally b; is also zero since 


bpec** =0. 


Denote the given set in Theorem 2.34 to be S. All the element in the set 
S is a solution by the proof of the Lemma before Theorem 2.34. Next, 
we prove that S' is linearly independent by induction on the number k 
of distinct zeroes. For the case k = 1, it has been proven by the Lemma 
before Theorem 2.34. Suppose now the set S is linearly independent for 
the case k <m. Assume that 


ee 


oS bg tec =0 


for some coefficient b;,;. Observe that 
(D - emI) (te) = jt?" e%** + (eG; — Cm). 


Since any differential operator is linear, we have 


mni-l1 : 
(D = Gil OOry a bj jt2e“*) =0. 


i=1 j=0 


Since all terms fo 7 = m are vanished by the differential operator, we may 
apply the induction hypothesis and know the coefficients for all terms in 
the left and side is zero. Observer that the coefficient of the term t™~!e%* 
is (Cj; — Cm)" bi.n,-1- This means (c¢; - Gm)" bin,-1 = 0 and so 6j,n,-1 = 0 
for all i < m. Thus we know that the coefficient of the term t”‘~*e%® is 
(Ci — Cm)”"™ Din;-2- Hence b;n,-2 = 0 for all i < m. Doing this inductively, 
we get b;,; = 0 for alli <m. Finally, the equality 


Nm-l1 : 
» bag ee =0 
implies bm,; = 0 for all 7 by the Lemma before Theorem 2.34. Thus we 


complete the proof. 


The second equality is the definition of range. To prove the first equality, 
we observe that R(g(Dy)) ¢ N(A(D)) since 


h(D)(g(D)(V)) = p(D)(V) = {0}. 
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14. 


15. 


16. 


Next observe that 
N(g(Dv)) = N(g(P)) 


since N(g(D)) is a subspace in V. By Theorem 2.32, the dimension of 
N(g(Dy)) = N(g(D)) is the degree of g. So the dimension of R(g(Dy)) 
is the degree of h(t) minus the degree of g(t), that is the degree of h(t). 
So N(h(D)) and R(g(Dv)) have the same dimension. Hence they are 
the same. 


(a) The equation could be rewriten as p(D)(y) = x, where p(t) is the 
auxiliary polynomial of the equation. Since D is surjective by the 
Lemma 1 after Theorem 2.32, the differential operator p(D) is also 
surjective. Hence we may find some solution yo such that p(D)(yo) = 
x. 


(b) Use the same notation in the previous question. We already know 
that p(D)(z) = a. If w is also a solution such that p(D)(w) = 2, then 
we have 


p(D)(w- 2) = p(D)(w) - p(D)(z) =e - x =0. 


So all the solution must be of the form z+y for some y in the solution 
space V for the homogeneous linear equation. 


We use induction on the order n of the equation. Let p(t) be the auxiliary 
polynomial of the equation. If now p(t) = t-c for some coeficient c, then 
the solution is Ce for some constant C by Theorem 2.34. So if Ce“ = 0 
for some to € R, then we know that C = 0 and the solution is the zero 
function. Suppose the statement is true for n < k. Now assume the degree 
of p(t) is k. Let a be a solution and to is a real number. For an arbitrary 
scalar c, we factor p(t) = q(t)(t-c) for a polynomial q(t) of degree k - 1 
and set z = q(D)(x). We have (D-clI)(z) = 0 since x is a solution and 
2(to) = 0 since « (tg) = 0 for all0< i<n-1. Again, z must be of the form 
Ce. And so Ce = 0 implies C = 0. Thus z is the zero function. Now 
we have q(D)(a) = z =0. By induction hypothesis, we get the conclusion 
that x is identically zero. This complete the proof. 


(a) The mapping ® is linear since the differential operator D is linear. If 
®(x) = 0, then « is the zero function by the previouse exercise. Hence 
® is injective. And the solution space is an n-dimensional space by 
Theorem 2.32. So The mapping is an isomorphism. 


(b) This comes from the fact the transformation ® defined in the previous 
question is an isomorphism. 


(a) Use Theorem 2.34. The auxiliary polynomial is t? + £. Hence the 


1 
basis of the solution space is 


{eit F eit Fy 
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or 


{cos nf, sin nf 


by Exercise 2.7.8. So the solution should be of the form 


A(t) = Ci costy/2 +O sinty/4 


for some constants C; and Co. 


(b) Assume that 
O(t) =Cy costy/2 +Cy sinty/4 


for some constants C and C2 by the previous argument. Consider 
the two initial conditions 


9(0) = on /fs 265 


and 
fi S gs... 
6'(0) = cny/4 =0. 

J 

Thus we get 
Ci= a)! 

g 

and 


C2 = 0. 


So we get the unique solution 


a0) = Oy /Leosey/F, 
g l 


(c) The period of cos t/Z is any /t. Since the solution is unique by the 


previous argument, the pendulum also has the same period. 


17. The auxiliary polynomial is ¢? + -. So the general solution is 


[k Lk 
y(t) = Ci costy / — + Cosinty / — 
m m 


for some constants C; and C2 by Exercise 2.7.8. 


18. (a) The auxiliary polynomial is mt? + rt+k. The polynomial has two 


ZETOeS 
—r+Vr2—Amk 


2m 


a= 
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and 
r—-Vr2—-4mk 
2m : 


Ao 


So the general solution to the equation is 


y(t) = Cye™ + Coe”*. 
(b) By the previous argument assume the solution is 
y(t) = Cye™ + Coe”*. 
Consider the two initial conditions 


y(0) = C1 + C2 =0 


and 
y' (0) = aC, + BC = U9. 
Solve that 
CL a (a = B) v0 
and 
C2 a (8 = a)7*vo. 


(c) The limit tends to zero since the real parts of a and ( is both -5—, 
a negative value, by assuming the r? —-4mk < 0. Even if r?-4mk > 0, 
we still know that a and £ are negative real number. 


19. Since F(R, R) is a subset of F(C,C), so if the solution which is useful in 
describing physical motion, then it will still be a real-valued function. 


20. (a) Assume the differential equation has monic auxiliary polynomial p(t) 
of degree n. Thus we know that p(D)() = 0 if # is a solution. This 
means that a) exists for all integer k <n. We may write p(t) as 
t” + q(t), where q(t) = p(t) —t” is a polynomial of degree less than n. 
Thus we have 

x") = -q(D)(z) 


is differentiable since x”) is a linear combination of lower order terms 
«c) with k <n-1. Doing this inductively, we know actualy « is an 
element in C™. 

(b) For complex number c and d, we may write c = c,+icg and d = d, +id2 
for some real numbers cj, c2, di, and dz. Thus we have 


etd = elertdi)+i(ertda) — 601 641 (cos(cy + dz) + isin(c2 + d2)) 


and 


d 


eve = eCet (cosce + isincg)(cos dz + isind2) 


= efe4[ (cos cz cos dz — sin cz sind.) + i(sin cz cos dz + cos c2 sind2) | 
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=e et (cos(c2 + dz) Cs isin(c2 a dz)). 


d d 


This means e“® = e%e® even if c and d are complex number|'| For the 


second equality, we have 


So we get 


Let V be the set of all solution to the homogeneous linear differential 
equation with constant coefficient with auxiliary polynomial p(t). 
Since each solution is an element in C®, we know that V 2 N(p(D)), 
where N(p(D)) is the null space of p(D), since p(D)(a) = 0 means 
that x is a solution. Conversely, if x is a solution, then we have 
p(D)(x) =0 and so x € N(p(D)). 


Let c = cy + ic2 for some real numbers c; and co. Directly compute 
that 
(e*)’ = (aiurieee = (e°* (cos cat + isin cgt))! 


c1e°'' (cos cat + isin cgt)) + icge™!" (cos cot + isin cot) 
(cy + icg)e%" (cos cgt + isin cat) = ce. 


Assume that x = 271+ ix2 and y = y, + tye for some £1, Xo, y1, and yo 
in F(R,R). Compute that 


(xy)! = (a1y1 — r2y2)' + (ays + toy)’ 
= (ayy + ry} — gyo- e2yg) + U(x ye + eiyy + xQy1 + Ly}) 
= (x1 + 209) (yi + ty) + (a1 + tx2)(y) + ty) = 2"y t zy’. 
Assume that x = 71 + 1x2 for some x; and x2 in F(R,R). If 
z=, +ix, =0, 


then x; = 0 and 2 = 0 since xj and 24 are real-valued functions. 
Hence x; and x2 are constant in R. Hence z is a constant in C. 


1The textbook has a typo that ect? = c°e?. 
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Chapter 3 


Elementary Matrix 
Operations and Systems of 
Linear Equations 


3.1 Elementary Matrix Operations and Elemen- 


1. 


tary Matrices 


(a) Yes. Since every elementary matrix comes from [,,, a square matrix. 
(b) No. For example, 2/; is an elementary matrix of type 2. 

) Yes. It’s an elementary matrix of type 2 with scalar 1. 

) 


(c 


(d) No. For example, the product of two elementary matrices 


(S08) (2 3) 


is not an elementary matrix. 
(e) Yes. This is Theorem 3.2. 


(f) No. For example, the sum of two elementary matrices 


(oi )(i o}-( 4) 


is not an elementary matrix. 
(g) Yes. See Exercise 3.1.5. 
1 0 1 0 
(h) No. For example, let A = 0 0 and B = 1or Then we can 
obtain B by add one time the first row of A to the second row of 
B. But all column operation on A can not change the fact that the 


second row of A is two zeros. 
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(i) Yes. If B= EA, we have E~'B = A and E! is an elementary matrix 
of row operation. 


2. By adding -2 times the first column of A to the second column, we obtain 
B. By adding -1 time the first row of B to the second row, we obtain C. 


1 0 0 1 0 0 100 
Fn B= (1 -4 r} eae 0 1 roe if "} 
a a Si. (Oi 03 1 
f.0s 3 1: 
Ex,=| 0 1 O J, B5=] 0 1 -I1 |. We have that 
0 0 00 1 
Es E4E3E2E,C = Is. 


The following is the process. 


3. (a) This matrix interchanges the first and the third row. So the inverse 


matrix do the inverse step. So the inverse matrix do the same thing 


and it is 
001 
0 1 Of. 
1 0 0 


(b) This matrix multiplies the second row by 3. To the inverse matrix 
multiplies the second row by 5 and it is 


10 0 
O20 fe 
001 


(c) This matrix adds -2 times the first row to the third row. So the 
invers matrix adds 2 times the first row to the third row and it is 


1 0 0 
0 1 0}. 
2 0 1 
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. A matrix who interchanges the i-th and the j-th rows is also a matrix who 
interchanges the i-th and the j-th columns. A matrix who multiplies the 
i-th row by scalar c is also a matrix who multiplies the 7-th column by 
scalar c. A matrix who adds c times the i-th row to the j-th row is also a 
matrix who adds c times the j-th column to the 7-th column. 


. We can check that matrices of type 1 or type 2 are symmetric. And the 
transpose of a matrix of type 3, who adds c times the i-th row(column) 
to the j-th row(column), is a matrix of type 3, who adds c times the j-th 
row(column) to the i-th row(column). 


. If B can be obtained from A by an elementary row operation, we could 
write B = EA. So we have B‘ = A‘ EF! and this means B can be obtained by 
A by elementary column operation with corresponding elementary matrix 
E*. If B can be obtained from A by an elementary column operation, 
we could write B = AE. So we have B' = E' A’ and this means B can be 
obtained by A by elementary row operation with corresponding elementary 
matrix E°. 


. It’s enough to check the following matrix multiplication is right. Let 
{u1,U2,.-.,Un} and {v1,v2,-..,Un} be the row and column vectors of A 
respectly. 


For row operations: 


i-th 1 -— Uy 
A= 
j-th 1 Sy eS 
1 . 
i-th c A= CU; 
1 
i-th 1 
A= 
j-th Cc 1 — cu; + UZ - 
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For column operations: 


a J 
-th —th 
1 | 
A 7 Vv; 
1 | 
1 
—th 
1 | 
A Cc = CU; 
1 | 
a J 
—th —th 
1 c 
A = 
1 


8. By Theorem 3.2 E~! is an elementary matrix of the same type if E is. So 
if Q can be obtained from P, we can write Q = EP and hence E~!Q = P. 


This means P can be obtained from Q. 


9. The operation of interchanging the i-th and the j-th row can be obtained 


by the following steps: 
e multiplying the 7-th row by -1,; 
e adding —1 time the i-th row to the j-th row; 
e adding 1 time the j-th row to the i-th row; 
e adding —1 time the i-th row to the j-th row. 


10. The operation of multiplying one row by a scalar c means dividing the 


same row by a scalar 4. 


11. The operation of adding c times of the i-th row to the j-th row means 
substracting c times of the i-th row to the j-th row. 


12. Assuming k = min{m,n}. Set 7 be a integer variable and do repeatly the 


following process: 
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e If Aj; =0 for all 7, take ¢=7+1 and omit following steps and repeat 
process directly. 


e If A;; +0 for some j, interchange the i-th and the j-th row. 


e Adding — fii times the i-th row to the j-th row for all 7 >i. 


e Set 7=72+1 and repeat the process. 


3.2. The Rank of a Matrix and Matrix Inverses 


1. (a) No. For example the rank of a 2x2 matrix with all entries 1 has only 
rank 1. 


(b) No. We have the example that the product of two nonzero matrices 
could be a zero matrix. 


) 
) Yes. This is Theorem 3.4. 

) No. They do. 

) Yes. This is the Corollary 2 after the Theorem 3.6. 

) Yes. This is the argument after the definition of augmented matrix. 
) Yes. Rank of an m xn matrix must be less than m and n. 

(i) Yes. This means Ly, is a surjective transformation from F” to F” 


and hence a injective transformation, where A is the matrix. 


2. In the following questions we may do the Gaussian elimination and the 
number of nonzero row vectors is equal to the rank of that matrix. 


(a) The rank is 2. 


HK 


Oo 


(b) The rank is 3. 


(c) The rank is 2. 


1 0 2 Ls 20e 2 

1 1 4 O 1 2 
(d) The rank is 1. 

1 2 2 a: 12 1 

2 4 2 0 0 0 
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(e) The rank is 3. 


ie eee a | i 3° oe: at 
14 012] [0 2 -3 0 1 
Oo Serpe d O08 2h 0. OF 
by 20: OY 6 G0" 28 a St 
dc D> Bt ie 
~|9 2-3 0 1 
0. Os Or 206 HO 
0 O26." 0 
(f) The rank is 3. 
Ts. 222 O- -a 7 tf oO a 
OA A Be 31) OO.) A 9 
Sok Ba PE BO ere 
= ae ae 0011 5 
OOo. i. 20 a a 
OMe i. Ty 0° 0° Pf He 
Oh GOs 0° No eae BS 
0000 7 0000 0 
(g) The rank is 1. 
fae) Pam ce ay. 1. Os, a 
2 Bes 000 0 
sae Orme ieee (el Pca 8 he I) 
i? Sie-202. 000 0 


3. It’s natural that rank(A) = 0 if A = 0. For the converse, we know that 
if A is not a zero matrix, we have A;; + 0 and thus the i-th row is an 
independent set. So rank(A) can not be zero. 


4. Just do row and column operations. 


(a) The rank is 2. 
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(b) The rank is 2. 


2 1 2 1 1 0 
-1 2 ]~] 0 3 ~| 0 1 
2 1 0 0 0 0 


5. For these problems, we can do the Gaussian elimination on the augment 
matrix. If the matrix is full rank"] then we get the inverse matrix in the 
augmenting part. 


(a) The rank is 2 and its inverse is ( > } 


1 O/};-1 2 1 O;-1 2 
“\o -1/-1 1/740 1/1 -1 
(b) The rank is 1. So there’s no inverse matrix. 


(c) The rank is 2. So there’s no inverse matrix. 


1 
== 3 =l 
2 

(d) The rank is 3 and its inverse is| 3-4 2 
1 —-2 1 

1 _1i ol 

3 2 
(e) The rank is 3 and its inverse is i 0 -+ 
-i 2 l 

6 3 2 


(f) The rank is 2. So there’s no inverse matrix. 
-51 15 7 12 


tote ices | ok. SP =k =F 
(g) The rank is 4 and its inverse is 103 1 2 
= 1 1 1 


(h) The rank is 3. So there’s no inverse matrix. 


6. For these problems we can write down the matrix representation of the 
transformation [T]®, where a = {uj,u2,...,Un} and B = {v1,v2,...,Un} 
are the (standard) basis for the domain and the codomain of T. And the 
inverse of this matrix would be B = [T~']3. So T~* would be the linear 
transformation such that 


T *(v;) = om Biju. 
w=1 


1This means rank(A) =n, where A is an nx n matrix. 
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(a) We get [T]? = | 


know that 


-1 


0 
0 


-1 -2 
B= 0 -1 
0 0 


-10 
-4 }|. So we 


=, 


T(a+ ba + cx”) = (-a - 2b- 10c) + (-b - 4c) x + (-c) x. 


0 1 0 
(b) We get [T]% = f 1 ) a matrix not invertible. 
1 0 1 


invertible. 


(c) We get [T]é = | 


know that 


T (a, b,c) (a 


1 
(ce) We get [T]® = f 
1 


(f) We get [T]8 = 


So T is not 
1 iz, 
i 24 ao aa 
-1 1 2] and [T*]g =|5 0 -]. So we 
iy “a? 
1 01 “6 3 8 
1 1 1 1 1 
got 5G 5% 5o at b+ 50). 
1 1 0 0 1 
-1 1] and [T-"]g = ? -+ 0 J. So we know 
0 0 3 3 -l 
1 1,1 1 
T(a, b,c) = (c, 50 5) att b-c) 
-1 1 0 1 0 
O O] and [T*]3 7 -5 0 2 . So we know 
i 
11 oe ie 
. 11 
T(a+ bu + cx*) = (b,-=a+ 50 5% b+=c) 
00 1 
00 1 : : i ; 
1102 matrix not invertible. So T is not 
1 1 0 


1 
1 
Fs 0 
0 


invertible. 


7. We can do the Gaussian elimination and record what operation we’ve 


done. 


Wo, 2s 
1 031|~ 
1 1 2 


2 
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1 
0 
1 


2 


1 
-2 0 
2 


8. 


9. 


10. 


11. 


1 “O-1 ie ed 
~{ 0 1 O0]~+{] 0 1 0 
Orel a 001 
100 
~{ 0 1 0 
001 
1 0 0 1). 16°30 I * 30:0) 1 -2 0 
Let FE, =]-1 1 O|, f=] 0 1 0}. 2-(1 -4 ije-(0 1 0 
0 01 -1 01 0 0 1 0 0 1 
100 Te J) - 3 
Es=|0 1 O|,£e={0 1 O|. 
Ok 4 00 1 


Thus we have the matrix equals to Ej! Ez'E3!Ez' Es‘ Eg}. 
It’s enough to show that R(L4) = R(L.a. But this is easy since 


R(La) a La(F”) = cL a(F™) = Lea(F”) a R(L,A). 


If B is obtained from a matrix A by an elementary column operation, then 
there exists an elementary matrix such that B= AE. By Theorem 3.2, E 
is invertible, and hence rank(B) =rank(A) by Theorem 3.4. 


Let A be an mx 1 matrix. Let 7 be the smallest integer such that A; # 0. 
Now we can interchange, if it’s necessary, the first and the i-th row. Next 
we can multiply the first row by scalar An and we get A;; = 1 now. Finally 
add —A,x1 times the first row to the k-th row. This finished the process. 


We may write Bo = . And thus we have that 


Let L =span{] . ]} be a subspace in F™*'. So we have that R(Lg) = 


0 
[+ R(Bo). And it’s easy to observe that all element in L has its first 
entry nonzero except 0. But all element in R( BG) has it first entry zero. 
So we know that Ln R(B6) = {0} and hence R(Lg) = L® R(Bo). By 
Exercise 1.6.29(b) we know that dim(R(Lg)) =dim(L)+dim(R(Bo)) = 
1+dim(R(B5)). 
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12: 


13. 


14. 


Next we want to prove that dim(R(B9)) =dim(B’) by showing N(Lp; ) = 
N(Lep-). We may let 


and scrutinize the fact 


Um 
is true. So N(Lg;) = N(Lp’) is an easy result of above equalitiies. Finally 


since Lp and L’, has the same domain, by Dimension Theorem we get 
the desired conclusion 


rank(B) = dim(R(Lg)) = 1+ dim(R(Bo)) 
= 1+dim(Zp-) = 1+ rank(B’). 


If B’ can be transformed into D’ by an elementary row operation, we could 
write D’ = EB’ by some elementary matrix FE. Let 


be an larger matrix. Then we have D = E’B and hence D can be obtained 
from B by an elementary row operation. For the version of column, we 
have D' = B’E. And then get the matrix E’ by the same way. Finally we 
have D= BE’. 


(b) By Theorem 3.5 and the Corollary 2 after Theorem 3.6 we have that 
the maximum number of linearly independent rows of A is the maxi- 


mum number of linearly independent columns of A‘ and hence equals 
to rank(A‘) =rank(A). 


(c) This is an instant result of (b) and Theorem 3.5. 


(a) For all ye R(T’ +U) we can express it as y = T(x) + U(x) € R(T) + 
R(U) for some re V. 


(b) By Theorem 1.6.29(a) we have 
rank(T + U) < dim(R(T) + R(U)) 


= dim(R(T)) + dim(R(U)) - dim( R(T) 9 R(UV)) 
< rank(T) + rank. 
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(c) We have that 
rank(A + B) = rank(L4+z) =rank(Ly + Lp) 
< rank(A) + rank(B). 


15. Let P = M(A|B) and Q =(MA|MB). We want to show that P;; = Q;,; for 


16. 


17. 


18. 


all ¢ and for all 7. Assume that A and B has a and 6 columns respectly. 
For j = 1,2,...,a, we have that 


Pij = > Mir Ary = (MA)iz = Qi;- 
k=1 
For j =a+1,a+2,...,a+b, we have that 
Piz =) Mix Bry = (MB)az = Quy. 
k=1 


Since P is invertible, we know that Lp is an isomorphism. So by Exercise 
2.4.17 we have that 


rank( PA) = dim(P(A(E"))) 
= dim(A(F")) = A(E")(A). 


by 
Let B= 8 and C= (c1 C2 c3). Thus we know that 


bg 
bic, bic by c3 
BC = bc, bce bac3 
b3c1 b3c2 b3¢3 


has only at most one independent rows. So the rank of BC’ is at most one. 


Conversely, if the rank of A is zero, we know that A = O and we can pick B 
and C such that they are all zero matrices. So assume that the rank of A 
is 1 and we have that the i-th row of A forms an maximal independent set 
itself. This means that we can obtained the other row of A by multiplying 
some scalar (including 0), say 6; for the j-the row. Then we can pick 


by il 
B= : and C= i: . Thus we get the desired matrices. 
b3 Ais 


Let A; be the matrix consists of the i-th column of A. Let B; be the 
matrix consists of the i-th row of B. It can be check that actually 


AB = A.B; 
a=1 


and A;B; has rank at most 1. 
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19. It would be m. Since the range of A is a subspace with dimension m in 
F™, we know that L4(F”) = F”. Similarly we also have that Dp (F”) =F". 
So we know that 
LaB(F?) = La(Lp(F?)) =F” 


and hence the rank of AB is m. 


20. (a) Just like the skill we learned in the Exercise 1.4.2 we can solve the 
system of linear equation Ax = 0 and get the solution space 


{(x3 + 305, -223 + 25,073, —2x5,25) TUE F}. 


And we know that {(1,-2,1,0,0),(3,1,0,-2,1)} is a basis for the 
solution space. Now we can construct the desired matrix 


1 3 00 0 
-2 1 0 0 0 
M=]1 0 0 0 0 
0 -2 0 0 0 
0 1 0 0 0 


(b) If AB = O, this means that every column vector of B is a solution 
of Ax =0. If rank of B is greater than 2, we can find at least three 
independent vectors from columns of B. But this is is impossible 
since by Dimension Theorem we know that 


dim(F’) = dim(R(L,)) + dim(N(La)) 
and so dim(N(LZ,)) =5-3=2. 


21. Let 6 = {e1,e2,...,@m} be the standard basis for F™. Since the rank of A 
is m, we know that L4 is surjective. So we can find some vector v; € F” 
such that La(v;) = e;. So let B be the matrix with column vector 1. 
Thus B is ann xm matrix and AB=T since Av; = e;. 


22. We know that B® is an mxn matrix with rank n. By the previous exercise 
we have some n x m matrix C such that B*C = I;,. We may pick A= C*. 
Now we have the fact that AB = C'B = (B‘C)! = (Im)' - Im. 


3.3 Systems of Linear Equation—Theoretical As- 
pects 


1. (a) No. The system that Oz = 1 has no solution. 
(b) 

(c) Yes. It has the zero solution. 

(d) No. The system that 0x = 0 has no solution. 


No. The system that Ox = 0 has lots of solutions. 
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(e) No. The system that 0x = 0 has lots of solutions. 


(f) No. The system Oz = 1 has no solution but the homogeneous system 
corresponding to it has lots of solution. 


(g) Yes. If Ax = 0 then we know x = A7'0=0. 
(h) No. The system x = 1 has solution set {1}. 


2. See Example 2 in this section. 


(a) The set {(-3,1)} is a basis and the dimension is 1. 

(b) The set 1G, a 1)} is a basis and the dimension is 1. 

(c) The set {(-1,1,1)} is a basis and the dimension is 1. 

(d) The set {(0,1,1)} is a basis and the dimension is 1. 

(e) The set {(—2,1,0,0), (3,0, 1,0), (-1, 0,0, 1)} is a basis and the dimen- 
sion is 3. 

(f) The set {(0,0)} is a basis and the dimension is 0. 

(g) The set {(-3,1,1,0),(1,-1,0,1)} is a basis and the dimension is 1. 


3. See Example 3 in this section. 


(a) The solution set is (5,0) + span({(-3, 1)}). 

(b) The solution set is (2, 5,0) + span({(4, 5, 1)}). 

(c) The solution set is (3,0,0) + span({(-1,1,1)}). 

(d) The solution set is (2,1,0) +span({(0,1,1)}). 

(e) The solution set is (1,0, 0,0)+span({(-2, 1,0,0), (3,0, 1,0), (—1,0,0,1)}). 
(f) The solution set is (1,2) + span({(0,0)}). 

(g) The solution set is (-1,1,0,0) + span({(-3, 1, 1,0), (1,-1,0,1)}). 


4. With the technique used before we can calculate A™! first and then the 
solution of Az = b would be Av‘b if A is invertible. 


-5 


(a) Calculate A7! = ( 9 


i and solution is 7; = —-11, x2 = 5. 


al 


2 
t 
9 


i 
(b) Calculate A~t = | i and solution is x; = 3, v2 =0, #3 = -2. 


wilrwlrR © 


5. Let A be the nx n zero matrix. The system Az = 0 has infinitely many 
solutions. 


6. If T(a, b,c) = (a+ b, 2a—c) = (1,11), then we get a+b=1 and 2a-c=11. 
This means the preimage set would be 


T *(1,11) = {(a,1-4,2a-11): ae R}. 
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10. 


11. 


12: 


13. 


14. 


. See Theorem 3.11 and Example 5 of this section. 


(a) It has no solution. 
(b 


It has a solution. 


c) It has a solution. 


( 
(d 


It has a solution. 


e) It has no solution. 


( 
(a) Just solve that a+ b= 1, b- 2c = 3, a+ 2c = -2 and get a=0, b=1, 
c=-1 is a solutoin. So we know ve R(T). 

(b) Just solve that a+b = 2, b-2c=1, a+2c=1 and get a=1, b= 1, 


c=0 is a solution. So we know ve R(T). 
. This is the definition of D4 and R(La). 


The answer is Yes. Say the matrix is A. Since the matrix has rank m, we 
have that dimension of R(L4) ism. But this means R(L,4) =F” since the 
codomain of Ly, is F” and it has dimension m. So it must has a solution 
by the previous exercise. 


Solve the system Ax = x and we can get x = (4, 2, +). And the amount 
of each entry is the ratio of farmer, trailor, and carpenter respectly. 


0.6 0.3 
we fe a 
be the input-output matrix of this system. We want to solve that Az = x, 


which means (A - J) = 0. By calculation we get x = t(3,4) for arbitrary 
t¢R. So the proportion is used in the production of goods would be e 


Set 


In this model we should solve the equation (J - A)a = d. And we can 
compute that J — A is invertible and 


a= (I-A) ‘d= (1). 


dol ao|o 


(I-A) += ( 


So we know 


0.50 0.20 


The input-output matrix A should be (isc 0.60 


) And the demand 


20)" 
get the answer x = (200. 250). Thus x is the support vector. 


vector d should be (23) So we can solve the equation (I —- A)x = d and 
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3.4 Systems of Linear Equations—Computational 
Aspects 


1. (a) No. This form could only fit for row operation. For example, x = 1 
and x = 0 has different solution set. 


(b) Yes. This is Theorem 3.13. 

(c) Yes. This is the result of Theorem 3.16. 
(d) Yes. This is Theorem 3.14. 

) 


(e) No. For example, the system with corresponding augmented matrix 
( 0] 1 ) has no solution. 


Yes. This is Theorem 3.15. 
Yes. This is Theorem 3.16. 


nonzero rows in A’ is the rank of A. And the number of nonzero rows 
in (A’|b’) is the rank of (Alb). So if they have different rank there 
must contain some nonzero rows (actually only one row) in (A’|b’) 
but not in A’. This means the nonzero row must has nonzero entry in 
the last column. Conversely, if some row has its only nonzero entry 
in the last column, this row did not attribute the rank of A’. Since 
every nonzero row in A’ has its corresponding row in (A’|b’) also a 
nonzero row, we know that two matrix have different rank. 


(b) By the previous exercise we know that (A'‘|b’) contains a row with 
only nonzero entry in the last column is equivalent to that A’ and 
(A’b’) have different rank. With the help of Theorem 3.11 we get 
the desired conclusion. 
4. (a) The solution set is {(4, ,0,0)+t(1,-1,1,2)}. The basis is {(1,-1, 1, 2)}. 
(b) The solution set is {(1,0,1,0) + s(-1,1,0,0)+¢(1,0,1,2)}. The basis 
is {(-1,1,0,0), (1,0, 1,2)}. 
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(c) It has no solution. 


5. Let R be the matrix in reduced echelon form. We know that there is an 
invertible matrix C such that CA = R. This means 


So we get 


And hence 


LY Ge Po ug. at 
A=COCR S(t Hi 9B) -S25-27 1, 
S-. eae OY. 39 


6. Let R be the matrix in reduced echelon form. We know that there is an 
invertible matrix C such that CA = R. But now we cannot determine 
what C is by the given conditions. However we know that the second 
column of R is —3 times the first column of R. This means 


3 3 
1 1 
0=RIO0]=CAIO]. 
0 0 
0 0 
Since C’ is invertible, we know that 
A =0. 


And this means the second column of A is also —3 times the first column 
of A. And so the second column of A is (—3,6,3,-9). Similarly we have 
that 


-4 —5 
0 -2 
-3 0 
A 1 =0=A 0 
0 1 
0 1 
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and get the answer that matrix A is 


1 -3 -1 1 O38 
—2 6 1 -5 1 -9 
-l 3 2 2 -3 2 
3-9 -4 0 2 = 5 


7. See Exercise 1.6.8. Note that if we put those vector as row vectors of 
just like what we’ve done in the Exercise 1.6.8, we cannot interchange any 
two rows. However we can also construct a matrix the i-th column u;. 
And we can do the row operation including interchanging any two rows. 
The set of columns containing one pivot? forms an independent set. 


2 1 -8 1 -3 
-3 4 12 37 -5 


1 1 3 
~10 1 0 7 -8 
0000 1 


So the set {u1,u2,us} is a basis for R®. 
8. Do the same just like what we’ve done in the previous exercise. 


2 -6 3 2 -l1 O 1 2 
-3 9 -2 -8 1 -3 O -l 
4 -12 7 2 2 -18 -2 1 
-5 15 -9 -2 1 9 3. (9 
2 -6 1 6 -3 12 -2 7 


3 1 1 
Fo a pat ce 
~lo 0 0 0 1 -4 -%8 —t9 
0 0 0 0 0 0 +1 =-!1 
0 0 0 0 0 0 0 +20 


We know that {u1,u3,Us,u7} is a basis for W. 


9. Use the representation of those matrix with some basis (usually take the 
standard basis) and do the same thing like previous questions on them. 


O 12 1 =-1 
=. 2 1. =2 2 
=1 2 1 =2 2 
1 3 9 4 -1 


?The position who is the first nonzero entry in one nonzero row is called a pivot. For 
example, the position 11, 22, 35 are pivots. 
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i) So 2h 2 S59 
ge ag Ce Gp 
PKG 0 — Br 0 
00 0 0 0 


So we know that the subset containing the first, the second, and the fourth 
matrix forms a basis for W. 


10. (a) 
(b) 
11. (a) 
(b) 


It’s easy to check that 


0-2+3-1-0+0=0. 


So the vector (0,1,1,1,0) is an element in V. Since the set contains 
only one nonzero vector, it’s linearly independent. 


As usual we can find a basis 
p= {(2;.1;0,0,0),. (=3,0,1;0;.0); (1,0, 0;.1, 0), (20,0, 0,1) }: 


So we know that {(0,1,1,1,0)}U@ can generate the space V. Do the 
same thing to this new set and remember to put (0,1,1,1,0) on the 
first column in order to keep it as an element when we do Gaussian 
elimination. 


02 -3 1 -2 

1 1 0 0 0 

1 0 1 0 0 

1 0 0 1 0 

00 0 0 1 
1 1 0 0 0 
0 1 -1 0 0O 
+10 0 1 -1 0 
00 0 0 1 
00 0 0 0 


Now we know that 
B" = {(0, 1,1, 1,0), (2, 1,0, 0,0), (-3,0, 1, 0,0), (-2,0,0,0,1)} 
forms a basis for V. 


Similarly check 
1-4+3+0-0=0. 
So the set containing only the vector (1,2, 1,0,0) is linearly indepen- 
dent by the same reason. 
Do the same thing to the set {(1,2,1,0,0)} U8. 


LY 2? 3) A 2 
2 1 0 0 O 
1 0 1 0 0 
00 0 1 O 
00 0 0 1 
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1. Ae 1h O70 
0 1 -2 0 0 
~10 0 0 10 
O10: (07 10. 
00 0 00 


Now we know that the set 
{(1, 2, 1,0, 0), (2, 1, 0, 0, 0), (1, 0, 0, 1, 0), (-2, 0, 0, 0, 1)} 
forms a basis for V. 


12. (a 


wa 


Set v1 = (0,-1,0,1,1,0) and ve = (1,0,1,1,1,0). Check the two 
vectors satisfy the system of linear equation and so they are vectors 
in V. To show they are linearly independent, assume that 


a(0,—1,0,1,1,0) +6(1,0, 1,1, 1,0) 
= (b,-a, b,at+ b,a+ 6,0) =0. 


This means that a = 6 = 0 and the set is independent. 


(b) Similarly we find a basis 
B= {(1,1,1,0,0,0), (-1,1,0,1,0,0) 


ths -2, 0,0, 1,0), (S83, —2,0,0,0, 1)} 


for V as what we do in the Exercise 3.4.4. Still remember that we 
should put v1, and v2 on the first and the second column. 


Of 1 a-Si. a 8 
Gack th ALo 29 
OG. <P A afin BO 40 
1d Sek o> 10 
din 1 OP <0 - FG 
O00 0. 0. 4 
1101 0 0 
0: Ad, 30 0 
02-00: Sod 
“10 000 0 1 
0000 0 0 
0000 0 0 


So the set 
{(0, -1,0, 1, 1,0), (1,0, 1, 1, 1,0), eae 1,0, 1,0,0), (-3, ~2, 0,0, 0, 1)} 


forms a basis for V. 
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13. 


14. 


15. 


(a) Set vy = (1,0,1,1,1,0) and ve = (0, 2,1,1,0,0). Check the two vectors 
satisfy the system of linear equation and so they are vectors in V. 
To show they are linearly independent, assume that 


a(1,0,1,1,1,0) + 6(0,2,1,1,0,0) 


= (a, 2b,a+ b,a+b,a,0) =0. 
This means that a = b = 0 and the set is independent. 


(b) Take the same basis 6 as that in the previous exercise and do Gaus- 
sian elimination. 


ON Sh. ot, 8 
Ook D eo: 22 
A is i “Ay Oe 7 
10: a Oe 
1 50070 Oy. a . 
000 0 0 1 
100 0 1 0 
6. On ie 
O00 Pel hy 
TT. Oc 1. od 
000 0 0 0 
000 0 0 0 


So the set 
(150, 11,15 0),(0;2;.1,,1,0,0),(1,1,.1,0;0)0),(—3,=2,0,0,/0;1)} 
forms a basis for V. 


It’s enough to check that A satisfies the definition of reduced echelon form 
on the page 185. For the first condition, a nonzero row in A is a nonzero 
row in (Alb). So it will precede all the zero rows in (Alb). But there may 
be some zero rows in A who are nonzero rows in (Alb). This kind of rows 
will be behind those nonzero row by the third condition for (Alb). The 
second condition for (Alb) implies the second condition for A. The third 
condition for (Alb) implies the third condition for A. 


We call a column whose corresponding column in one fixed reduced echelon 
form contains a pivot(see Exercise 3.4.7) a pivotal column. Now we induct 
on the number of columns of a matrix. For matrix contains only one 
column wu ,, the reduced echelon form of it would be the column e, if uy 
is nonzero (and hence {u} is independent) and that of it would be the 
zero column if u; is zero (and hence {u,} is dependent). Suppose that 
the reduced echelon form of a matrix with k columns is unique. Now 
consider a matrix A with k+1 columns, say u1,u2,...,Uz+1. Let A’ be the 
matrix by deleting the final column of A. So we can write A = (A’|uz41). 
Say (R’|b) is a reduced echelon form of A. By the previous exercise we 
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know that R’ is a reduced echelon form of A’. And R’ is unique by our 
induction hypothesis. So the set of pivotal columns P’ in A’ is also unique. 
By Theorem 3.16(c) and Theorem 3.16(d) we know that the set P’ in A’ is 
a maximal independent set of {u1,u2,...,uz}. Now if P’U{uxz} is linearly 
independent, this means 


rank(A) = rank(A’) + 1, 


say the value is r. By Theorem 3.16(a) and Theorem 3.16(b), } is the only 
column who can be e, and so we know that b = e,. On the other hand, if 
P’ U{ux} linearly dependent. The vector uz cannot be a pivotal column 
of A since P’ U {uz} is the set of pivotal column of A and it must be 
linearly independent. Futhermore, u;, has an unique representation with 
respect to the set P’. By Theorem 3.16(d), the column vector d must be 
the representation of uz. By all cases we know that (R’|b) is also unique. 
And by induction we get the desired conclusion. 
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Chapter 4 


Determinants 


4.1 Determinants of Order 2 


1. (a) No. We have det(2/2) = 4 but not 2det(Z>). 
(b) Yes. Check that 


det 6 ve a | = (a,d — bic) + k(azd - bye) 


- a, by ag by 
= act (" 1) «ae ’) 


a b 
det (. + keg dy + kdy 


a b a b 
= det & a + kdet ( 


for every scalar k. 
(c) No. A is invertible if and only if det(A) # 0. 


and 
= (ad, = bc1) + k(ad2 = bez) 


(d) No. The value of the area cannot be negative but the value det (‘) 


could be. 
(e) Yes. See Exercise 4.1.12. 


2. Use the formula in Definition in page 200. 


(a) The determinant is 6 x 4—- (-3) x 2 = 30. 
(b) The determinant is -17. 

(c) The determinant is -8. 
3. (a) 


a) The determinant is —-10 + 152. 
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10. 


(b) The determinant is -8 + 29%. 
(c) The determinant is -24. 


. Compute | det (‘) |. 


. It’s directly from the fact 


aet(¢ i)ea da = -(ad - be) = ter 


. It’s directly from the fact 


aut? j) = ad ma =0 
a b 


. It’s directly from the fact 


ac a 
aee(} i) ed cb=ad be=aet(® 


. It’s directly from the fact 


a b 
ave(5 j) = ad = 0 = ad 


. Directly check that 


aet((? (6 {jean (ere 


a Oo 
c dj)- 


i): 


+ bh 
cf +dh 


= (aet+bg)(cf+dh)-(af+bh)(ce+dg) = ad(eh-fg) 


= det (: x det (; ) . 


a 


For brevit ite A= 
or brevity, we write (’ i 


for the corresponding classical adjoint. 
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A for some 2 x 2 matrix and C' = ( 


be(eh- fg) = (ad-be)(eh-fg) 


d 


-c 
—-b a 


(a) Directly check that 


and 


(b) Calculate that 


det(C’) = da - (-c)(-b) = ad - be = det(A). 


Qa Cc 


b d 


( d a at 
=¢ a 


(d) If A is invertible, we have that det(A) + 0 by Theorem 4.2. So we 
can write 


(c) Since the transpose matrix A‘ is ( ) the corresponding classical 


adjoint would be 


[det(A)] ‘CA = A[det(A)]'C =I 
and get the desired result. 


11. By property (ii) we have the fact 


(i Joo( 


Since by property (i) and (ii) we know 
1 1 1 0 0 1 
vea{t a}eo{t a}o(r a) 
1 0 1 0 0 1 0 1 
“alr o)ea{e dealt a)eals a) 
1 0 0 1 
(0 )6(1 9) 
0 1 1 0 
at 0) =-8(5 i) =-1 


by property (iii). Finally, by property (i) and (iii) we can deduce the 
general formula for 6 below. 


a b 1 0 0 1 
a(t t)=as (2 f)+05(2 3) 
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we get that 


= ad = be= det (” Hy 
c od 


12. A coordinate system {u = (a, b),v = (c,d)} is right-handed means u’-v > 0 
where the vector u’ = (-b,a) is obtained by rotating the vector u in a 


counterclockwise direction through an angle 5. With the fact u’-v = 


ad — bc = det (") we get the conclusion. 


4.2 Determinants of Order n 


1. (a) No. See Exercise 4.1.1(a). 

(b) Yes. This is Theorem 4.4. 

(c) Yes. This is the Corollary after Theorem 4.4. 
) 


(d) Yes. This is Theorem 4.5. 


(e) No. For example, the determinant of F ') is 2 but not det(Z) = 1. 


(f) No. We have that (; ') =1+#2det(Z) =2. 


2 1 


(g) No. For example, the determinant of identity matrix is 1. 
(h) Yes. See Exercise 4.2.23. 


2. Determinant is linear when we fixed all but one row. So we have that 
3a, 3a. 3a3 a a2 a3 
det 3b} 3b 3b3 = 3det 3b} 3b 3b3 
3c, 3c2 3¢3 3c, 3c 3¢3 


ay ag a3 a, aQ a3 
= 9det by bg bs = 27 det by bo bs : 


3c, 3C2 3¢3 C1 C2 C3 


Hene we know that k = 27. 


3. We can add -3 times of the third row the the second row without changing 
the value of determinant and do the same as the previous exercise and get 
the conclusion that k = 2 x 3x 7 = 42. 


4. See the following process. 
by + Cy bz + co b3 + ¢3 -(b1 +¢1) —(b2 + €2) —(b3 +¢3) 
det ay+Cy Agt+Cg a3g+C3]=—- det ay +c, ag+ C2 a3 + C3 
a, + by Ag + be a3 + bg a, + by az + be a3 +b 
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Ne) oo “NI aD 


10. 
11. 
12. 
13. 
14. 
15. 
16. 
Lh. 
18. 
19. 


ay a2 a3 a, a2 
=-2det]a, +c, agtco ag+c3|=-—2det]c. co 


ay + by agt be a3 t+ b3 by bo 


ay a2 a3 
= 2det by bg b3 


C1 C2 C3 


The first equality comes from adding one time the second row and one 
time the third row to the first column and the second equality comes from 
adding —1 time the first row to the second and the third row. Finally we 
interchange the second and the third row and multiply the determinant 


by -1. Hence k would be 2. 


. The determinant should be -12 by following processes. 


0 -3 -1 -3 -1 0 
-oaet(§ p) ace ( ) e2aet(5 3 


=0x9-1x6+2x (-3) =-12 


. The determinant should be -13. 
. The determinant should be —-12. 
. The determinant should be -13. 
. The determinant should be 22. 


The determinant should be 4+ 27. 
The determinant should be -3. 
The determinant should be 154. 
The determinant should be -8. 
The determinant should be -168. 
The determinant should be 0. 
The determinant should be 36. 
The determinant should be —49. 
The determinant should be 10. 


The determinant should be —28 — 7. 
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a3 
C3 


20. 
21. 
22: 
23. 


24. 


25. 


26. 


The determinant should be 17 — 3%. 
The determinant should be 95. 
The determinant should be —-100. 


Use induction on n, the size of the matrix. For n = 1, every 1 x 1 matrix is 
upper triangular and we have the fact det (a) =a. Assuming the statement 
of this exercise holds for n = k, consider any (n+1)x(n+1) upper triangular 
matrix A. We can expand A along the first row with the formula 


det(A) = > (-1)'7 Ai; det(Aj;). 


+ 
=1 


Jj 


And the matrix Aaj, j #1, contains one zero collumn and hence has rank 
less than n+1. By the Corollary after Theorem 4.6 those matrix has 
determinant 0. However, we have the matrix Ait is upper triangular and 
by induction hypothesis we have 


7 ntl 
det(A11) = ig Ajj. 
i=2 
So we know the original formula would be 
ntl 


det (A) = A det(A11) = I] Ay. 


1=1 


Let z be the zero row vector. Thus we have that 


z Oz 
Qr+1 Qr+1 Qr+1 
an an an 


Applies Theorem 4.3 to each row. Thus we have hat 


kay, ay ay 
det | "42 | = kaet| "4? | =... = 4" aet | 
kan kan an 


By the previous exercise the equality holds only when n is even or F has 
characteristic 2. 
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27. 


28. 


29. 


30. 


If A has two identical columns, then the matrix A is not full-rank. By 
Corollary after Theorem 4.6 we know the determinant of A should be 0. 


The matrix £, can be obtained from J by interchanging two rows. So 
by Theorem 4.5 the determinant should be -1. The matrix E2 is upper 
triangular. By Exercise 4.2.23 the determinant should be c, the scalar by 
whom some row was multiplied. The matrix E3 has determinant 1 by 
Theorem 4.6. 


The elementary matrix of type 1 and type 2 is symmetric. So the statement 
holds naturally. Let E be the elementary matrix of type 3 of adding k times 
of the i-th row to the j-th row. We know that E* is also an elementary 
matrix of type 3. By the previous exercise we know that this kind of 
matrix must have determinant 1. 


We can interchange the i-th row and the (n+ 1 -7)-th row for all 7 = 
i yee L3 [| Each process contribute —1 one time. So we have that 


det(B) = (-1)!#! det(A). 


4.3. Properties of Determinants 


1. 


2. Since we have the condition ter 


(a) No. The elementary of type 2 has determinant other than 1. 

(b) Yes. This is Theorem 4.7. 

(c) No. A matrix is invertible if and only if its determinant is not zero. 
) 


(d) Yes. The fact that n xn matrix A has rank n is equivalent to the 
fact that A is invertible and the fact that det(A) # 0. 


(e) No. We have that det(A‘) = det(A) by Theorem 4.8. 
(f) Yes. This is the instant result of Theorem 4.4 and Theorem 4.8. 


(g) No. It still require the condition that the determinant cannot be 
Zero. 


(h) No. The matrix M;, is the matrix obtained from A by replacing 
column k of A by 6. 


a11 
a21 


a12 
= 4122 — A422, + 0, we can 
a22 


1The symbol |2] means the greatest integer m such that m< a. 
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use Cramer’s rule and get the answer. 


wo 2) 
C1 bz daz bi ag2-a12b2 , 
ie a11 aj42 11422 12421 
421 422 
ao 4 
ro = az1 be @yibi—baaa1 
que Qi1 a\2 11422 12421 
a21 422 
3. The answer is (%1,%2,%3) = (4,-3,0). 
4. The answer is (21, %2,2%3) = (-1,-8,-2). 
5. The answer is (21, 22,23) = (—20,-48, -8). 
6. The answer is (21,22,23) = (—43,-109,-17). 
7. The answer is (#1, 22,273) = (42,110, 18). 
8. By Theorem 4.8 we know that det(A‘) = det(A). So we can write Theorem 


4.3 into column version by “The determinant of an nxn matrix is a lineaer 
function of each column when the remaining columns are held fixed.” . 


9. By Exercise 4.2.23, the determinant of an upper triangular matrix is the 
product of its diagonal entries. So it’s invertible if and only if all its 
diagonal entries are nonzero. 


10. If M is nilpotent, we say M* = O for some positive integer k. So we have 
0 = det(O) = det(M*) = (det(M))*. 
This means that det(J/) must be zero. 


11. By Exercise 4.2.25 we have det(-M) = (-1)"det(M). By Theorem 4.8 
we have det(M‘) = det(M). So the conclusion is that 


(-1)" det(M) = det(M). 


If n is odd, we can conclude that det(/) = 0 and hence M is not in- 
vertibl?] If n is even, we cannot say anything. For example, the matrix 


( | is invertible while the matrix 2x 2 zero matrix Oz is not invertible. 


12. This is an instant result by 


1 = det(Z) = det(QQ") = det(Q) det(Q*) = det(Q)’. 


It should be remarked that “—a = a implies a = 0” holds only when the field has charater- 
istic other thant 2. In this question the field C has characteristic zero 
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13. 


14. 


15. 


16. 


17. 


18. 


(a) For the case n = 1 we have 
det(M) = My = det(M) 


By induction, suppose that det(M) = det(M) for Misakxk 
matrix. For a (n+ 1) x (n+1) matrix M, we have 


det(M) = 3.7 = 1"(-1)"7 My; det(M;j) 
= S\j = 1"(-1)**7 My; det(M,j) = det(M). 
(b) This is an instant result by 


1 =| det(Z)| = | det(QQ")| = | det(Q)|| det(Q*)| = | det(Q)/’. 


The set @ is a basis if and only if @ is an independent set of n elements. 
So this is equivalent to the set of columns of B is independent and hence 
B has rank n. And all of them are equivalent to that B is invertible and 
hence det(B) # 0. 


Two matrix A and B are similar means A = C7!BC. So we get the 
conclusion 


det(A) = det(C7'BC) = det(C™') det(B) det(C) = det(B). 
Since the fact 
1 = det(Z) = det(A) det(B) 
implies det(A) and det(B) cannot be zero, we know A is invertible. 
By Exercise 4.2.25 we have 
det( AB) = (-1)”" det(A) det(B). 


This means det(A) and det(B) cannot be invertible simultaneously. So A 
or B is not invertible. 


For the first case, let A be a matrix of type 2 meaning multiplying the 
i-th row by a scalar c. We have det(A) = c by Exercise 4.2.28. And since 
determinant is linear function of the i-th row when other rows are held 
fixed, we have 


det( AB) = cdet(B) = det(A) det(B). 
For the second case, let A be a matrix of type 3 meaning adding c times 
the i-row to the j-th row. We have det(A) = 1 by Exercise 4.2.28. And 


since determinant will not change when we adding some times the 7-row 
to the j-th row, we have 


det( AB) = det(B) = det(A) det(B). 
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19. 


20. 


215 


22. 


Since the transpose of a lower triangular matrix is an upper triangular 
matrix, we have that the determinant of a lower triangular matrix is the 
product of all its diagonal entries. 


We can expand the matrix by the n-th row and then by (n- 1)-th row 
inductively. So we have that det(W/) = det(A). Similarly, if we expand 
the matrix below by the first row and the second row inductively, we get 
the identity 


avt(4 | = det(D). 


First, if C is not invertible, the set of row vectors of C’ is not independent. 
This means the set of row vectors (O C) is also not independent. So it’s 
impossible that M/ has n independent rows and hence it’s impossible that 
M is invertible. The conclusion is that if C is not invertible, we have 


det(A) det(C) = det(A)0 = 0 = det(M). 
Second, if C is invertible, we have the identity 
Ir O\(A B\ (A B 
(6 c)(o é)-(6 7}: 
So we get the identity 
det(C™*) det(M) = det(A) 


and hence 
det(M) = det(A) det(C). 


(a) We have that 


T(1) = ley t+ leg t+ + lend 
T(@) = coe, + cyeg + + Cnens1 
T(a") = chevt+chegt--+Crens, 
where {€1,€2,..-,€n+1} is the standard basis for F"*!. So we get the 


desired conclusion. 


(b) By Exercise 2.4.22 T is isomorphism and hence invertible. So the 
matrix M is also invertible and hence det(/) # 0. 
1 Co 
(; |) She co: 
Suppose the statement of this question holds for n = k - 1, consider 
the case for n = k. To continue the proof, we remark a fomula first 
below. 


(c) We induction on n. For n = 1, we have det 


ak ~y S (x - y)(a*t + ah ey 4 et yh) 
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For brevity we write 


pay k) =a + ak yt ety 


k-1 


Now to use the induction hypothesis we can add —1 time the first 
row to all other rows without changing the determinant. 


1 © ee CO 
era eae 
1 j e Cr 
1 Co red a co 
0 


mie C1 : co (a1 - co)PCC1 0,2) 


(c = Co) p(C1, C0, ) 


0 Cn — CO (Cn — Co) P(En; Co, 2) (Cn — €Co)P(En, Co, 2) 


n 1 p(C1,€0, 2) P(C1,€0,) 
= [| (c; - co) det : : 
jel 


P(Cn; Co, 7) 

Now we write e; = (ci,ch,...,c,)* for i = 0,1,...n-1. So the de- 
terminant of the last matrix in the equality above can be written 
as 


1 P(En, Co, 2) 


det (€0 €1+Cp€o0 €2 + Coe + CRE1 €n-1 + Co€n_2g to + che) 


= det (€0 €y €2+Ce1 €n-1 + Co€n-2 + -) 


= det (€0 e1 e2 €n-1 + Coen-2 + +) 


ad det (€0 Ey €2 en-1) : 


And by induction hypothesis, the value of it would be 


I] (G:—G); 


lsi<j<n 
Combine two equalities above we get the desired conclusion. 


23. (a) We prove that rank(A) > & first. If rank(A) =n, then the matrix A 
has nonzero determinant and hence the interger k should be n. Now 
if rank(A) =r <n, we prove by contradiction. If & >rank(A), we can 
find a kx k submatrix B such that the determinant of it is not zero. 
This means the set 

S= {v1, v2, . .. UR} 


of columns of B is independent. Now consider the k x n submatrix 
C of A obtained by deleting those rows who were deleted when we 
construct B. So S is a subset of the set of columns of C’. This means 


k > rank(C) < min{n,k} =k 
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and hence rank(C) = k. So this also means that the set of the k rows 
of C independent. And thus the matrix A contains k independent 
rows and hence rank(A) > k, a contradiction. 

Conversely, we construct a rxr submatrix of A, where r is rank( A), to 
deduce that rank(A) < k. Since rank of A is r, we have r independent 


rOWS, Say U1,U2,.--.,Ur. Let D be the r x n submatrix such that the 
i-th row of D is u;. Since the set of rows of D is independent, we 
have that 


r<D<min{r,n}=r 
and hence rank(D) = r. Similarly we have w 1, we2,...,w, to be the 
r independent columns of D. And si-similarly we can construct a 
rxr matrix EF such that the i-th column of FE is w;. Since FE is a 
r xr matrix with r independent rows, we have rank(£) = r. This 
complete the proof. 


(b) See the second part of the previous exercise. 


24. We use induction to claim that 


0 . 
det(A+tI)=t"+ >> a;t’. 
w=n-1 
For n = 1, it’s easy to see that det(A+tI) = t+ ag. Suppose that the 
statement holds for n = k- 1, consider the case for n = k. We can expand 
the matrix and get 


t O O - O ao 
St. 2 the ate dag 
det} 0 -l t + O ag 
0 O O ++ -L api tt 
t 0 0 ay -l1 ¢t «+ O 
= tdet é 0 i + (-1)1*¥ ao det 2 
0 0 -1 a1 0 0 t 


25. (a) Just expand along the k-th column. 


(b) It’s better not to use a great theorem, such as Cramer’s rule, to killa 
small problem. We check each entry one by one. First, we have that 


ye, AjeCjk = det (A) 
k=1 
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and so the j-th entry of the left term is det(A). Second, for i # 7, we 
construct a matrix B; by replacing the j-th row of A by the i-th row 
of A. Since B; has two identity rows, we have that det(B;) = 0 for 
alli #7. Now we can calculate that 


>> Aincjn = ¥ Byrcgn = det(B) = 0, 
k=1 k=1 
for all i + 7. So we get the desired conclusion. 


(c) Actually this matrix C is the classical adjoint of matrix A defined 
after this exercise. And this question is an instant result since 


C11 C21 Cn1 

AC=A C12 ©22 Cn2 

Cln Con Cnn 

det(A) 0 oe 0 

waf 9 eet) 0 
0 0 «+ det(A) 


by the previous exercise. 
(d) If det(A) # 0, then we know A is invertible. So we have 


A = A A[det(A)]*C = [det(A)] °C. 
26. (a) We have that : 
c11 = (-1)? det((A)11) = Aga, 
C1 = (-1)° det((A)12) = — Aon, 
coy = (-1)° det((A)a1) = - Ate, 
Ca9 = (-1)* det((A) a2) = Air. 


So the adjoint of matrix A is 
Ao. —Aip2 
-Ay Au} 


(b) The adjoint of that matrix is 
16 0 0 
0 16 Of. 
0 O 16 
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27. 


(c) The adjoint of that matrix is 
(d) The adjoint of that matrix is 
(e) The adjoint of that matrix is 
-34 
4 
10 + 162 
(f) The adjoint of that matrix is 


(g) The adjoint of that matrix is 


(h) The adjoint of that matrix is 


10 40 0 
0 -20 Of]. 
0 0 -8 
20 -30 20 
0 15 -24]. 
0) 0 12 
0 0 
-1+%2 0 / 
-5-31 343i 
6 22 12 
12 -2 24 |. 
21 -38 -27 
18 28 -6 
-20 -21 37 |]. 
48 14 -16 
-8+7 -1+2i 
9 - 61 383i. 
-3 3-7 


—1 
1-5i 
-1+12 


(a) If A is not invertible, we have AC = [det(C)|J = O. It’s impossible 
that C is invertible otherwise A = C-1O = O. But the adjoint of the 
zero matrix O is also the zero matrix O, which is not invertible. So 
we know that in this case C is not invertible and hence det(C) = 0= 
[det(A)]""!. Next, if A is invertible, we have, by Exercise 4.3.25(c) 


that 


det(A) det(C) = det([det(A)]J) = [det(A)]” 


So we know that 


det(C) = [det(A)]"~! 


since det(A) + 0. 
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(b) This is because 
det(At;;) = det(A‘,) = det(A). 


(c) If A is an invertible upper triangular matrix, we claim that c;; = 0 
for alli, 7 with 7 >. For every i, 7 with 7 > 7, we know that 


Cig = (-1)'*7 det (Aj;). 


But Ai; is an upper triangular matrix with at least one zero diagonal 
entry if i>. Since determinant of an upper triangular matrix is the 
product of all its diagonal entries. We know that for 7 > 7 we have 
det (Aj;) = 0 and hence c;; = 0. With this we know that the adjoint 
of A is also a upper triangular matrix. 


28. (a 


wa 


For brevity, we write 


v(y(t)) = W@), 9,9 () 
and 
vilt) = (y(t), O.-yPO)" 
Since the defferential operator is linear, we have 
u((x + cy)(t)) = v(a(t)) + cv(y(t)). 
Now we have that 
[T(a + cy)](t) = det (v((w + ey)(t)) v(t) v2(t) + un (t)) 
= det (v(x(t)) + cv(y(t)) vit) w(t) + vn(t)) 
=[T(x)]@) + [T(y)]@) 


since determinant is a linear function of the first column when all 
other columns are held fixed. 


(b) Since N(T) is a space, it enough to say that y; ¢ N(T) for all 7. But 
this is easy since 


[T(y) ](t) = det(v;(t), v1 (t),...,0n(t)) = 0. 


The determinant is zero since the matrix has two identity columns. 


4.4 Summary—Important Facts about Determi- 
nants 


1. With the help of Theorem 4.8 the statement in row version or column 
version is equivalent. So we won’t mention it again and again. 
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(a) Yes. It’s Exercise 4.3.8. 


(b) Ur...No! It’s wise to check whether there are any two identity rows 
or two identity columns first. How do we know what is wise? 


Yes. See the Corollary after Theorem 4.4. 

No. The determinant should be multiplied by —1. 
No. The scalar cannot be zero. 

Yes. Watch Theorem 4.6. 

Yes. This is Exercise 4.2.23. 

No. Read Theorem 4.8. 

(i) Yes. Peruse Theorem 4.7. 

(j) Yes. Glance the Corollary after Theorem 4.7. 

(k) Yes. Gaze the Corollary after Theorem 4.7. 


The determinant should be 22. 
The determinant should be 4 + 2%. 
The determinant should be -3. 
The determinant should be 154. 


The determinant should be 10. 
The determinant should be —-28 - 7. 
The determinant should be 17 - 32. 
The determinant should be 95. 
The determinant should be -100. 


5. See Exercise 4.3.20. Remember to write it down clearly one more time. 
The given proof there is simplistic. 


6. Ur...look Exercise 4.3.21. 
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4.5 A Characterization of the Determinant 


1. (a) No. For example, we have det(2I) # 2det(I) when I is the 2 x 2 
identity matrix. 


(b) Yes. This is Theorem 4.3. 

(c) Yes. This is Theorem 4.10. 

(d) No. Usually it should be 6(B) = -6(A). 

(e) No. Both the determinant and the zero function, 6(A) = 0, are n- 


linear function. 


(f) Yes. Let v1, U1, U2,..-,Un are row vectors and c are scalar. We have 
U1 + CUY Ui U1 
5] “P| =0=0+c-0=6]"? [+ ca]? 
oe Un Un 


The cases for other rows are similar. 
2. A 1-linear function is actually a linear function. We can deduce that 
6 (x) = a6 (1) = an, 


where a is defined by (1). So all the functions must be in the form 6 (zx) = 
ax. 


3. It’s not a 3-linear function. We have that 


P10 
5[0 1 O]=k#=26(I3) = 2k. 
001 


4. It’s not a 3-linear function. We have that 


20 0 
5{0 1 O}=14=26(J3) =2. 


0 0 1 


5. It’s a 3-linear function. We have that when the second and the third rows 
are held fixed, the function would be 


Ay, Ai Ais 
6} A21 Ag Ags | = (Air, Ai2, Ai3) - (A23A32, 0,0), 
Asi As. Ass 
a inner product function. So 6 is linear for the first row. Similarly we can 
write 
Ai Aiz Ais 
bd] Aor Age Aas 


A31 A32 A33 
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10. 


11. 


= (Aoi, Aza, Aa3) - (0,0, A11 A32) = (A31, A32, A33) - (0, A123, 0). 


So 6 is a 3 linear function. 


. It’s not a 3-linear function. We have that 


20 0 
5{0 1 0} =44=26(J3) =6. 
001 


. It’s a3 linear function. We could write 


Ay, Ai2 Aig 
6] Aor Age Aog3 | = (Arr, A12, Aig) - (A21A32, 0,0) 
A3zi Az32 A33 


(Agi, A22, Ao3) ; (Aj, A32, 0, 0) = (A31, Ago, A33) . (0, Ay; -A21,0) 
and get the result. 


. It’s not a 3-linear function. We have that 


1 0 0 1 0 0 
670 2 Of=1#=26)0 1 OF =2. 
1 1 0 1 1 0 


. It’s not a 3-linear function. We have that 


20 0 
5{0 1 O}=44=26(J3) =2. 
001 


It’s a 3 linear function. We could write 
[i Aj. Ais 
6 


Ag, Age ‘a = (Aji, Aig, A13) - (A22A33 - A + 21A32,0,0) 
A31 Az32  A33 


= (Agi, Ave, Ao3)-(—A11 A432, Ai1 A33, 0) = (A31, A32, A33)-(0, Art A421, Ai1 A22) 
and get the result. 


Corollary 2. Since 6 is n-linear, we must have 6(A) = 0 if A contains one 
zero row. Now if M has rank less than n, we know that the n row 
vectors of M are dependent, say ui, U2,...,Un. So we can find some 
vector u; who is a linear combination of other vectors. We write 


Uy = ae # 10; Uj. 


By Corollary 1 after Theorem 4.10 we can add —a; times the j-th 
row to the i-th row without changing the value of 6. Let M’ be the 
matrix obtained from M by doing this processes. We know M’ has 
one zero row, the i-th row, and hence 6(M5(M’) = 0. 
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12. 


13. 


Corollary 3 We can obtain FE, from J by interchanging two rows. By 
Theorem 4.10(a) we know that 6(£,) = -d(). Similarly we can 
obtain £2 from I by multiplying one row by a scalar k. Since 6 is 
n-linear we know that 6(£2) = kd(1). Finally, we can obtain E3 by 
adding & times the i-th row to the j-th row. By Corollary 1 after 
Theorem 4.10 we know that 6(£3) = 6(Z). 


If A is not full-rank, we have that AB will be not full-rank. By Corollary 
3 after Theorem 4.10 we have 


5(AB) = 0 = 6(A)d(B). 


If A is full-rank and so invertible, we can write A = E,---E2E, as product 
of elementary matrices. Assuming the fact, which we will prove later, that 


5(EM) = 6(E)6(M) 


for all elementary matrix F and all matrix M holds, we would have done 
since 
0(AB) = 0(E,:- FoF, B) = 6(E,)6(Ex-1--- 2 EB) 


= = 0(B.)---0(E2)d(E1)d(B) 
= 0(E,:-F2E))6(B) = 6(A)6(B). 
So now we prove the fact. First, if E is the elementary matrix of type 
1 meaning interchangine the i-th and the j-th rows, we have EM is the 


matrix obtained from M by interchanging the 7-th and the j-th rows. By 
Theorem 4.10(a) we know that 


6(EM) = -6(M) = -5(1)6(M) = 6(E£)5(M). 


Second, if E is the elementary matrix of type 2 meaning multiplying the 
i-th row by a scalar k, we have EM is the matrix obtained from M by 
multiplying the i-th row by scalar k. Since the function 6 is n-linear, we 
have 

(EM) = ké(M) = ké(1)6(M) = 6(F)6(M). 


Finally, if EF is the elementary matrix of type 3 meaning adding k times 
the i-th row to the j-th row, we have EM is the matrix obtained from 
M by adding k times the 7-th row to the j-th row. By Corollary 1 after 
Theorem 4.10, we have 


6(EM) = 6(M) = 6(1)6(M) = 6(£)6(M). 
This complete the proof. 


Since the fact det(A‘) = det(A) and det is a 2-linear function, the result 
is natural. 
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14. 


15. 
16. 


We could write 


(4 Aj2 


= (Au, Aj2) > (Ag2a + Aoib, Asoc + Aid) 
Ao, Age 


= (Agr, Age): (Aird + Ajod, Aria + Ajoc) 
and get the desired result since inner product function is linear. 


For the converse, fixed one n-linear function 6 and let 


1 O 1 0 
a=a(5 1).0=6(; Ir 


0 1 0 1 
cna(2 t)ens(® 2) 
Now we must have 
Ay, Ai 1 0 0 1 
é6 = A116 Aj26 
be | ut te ba) my be 4) 


1 0 1 0 0 1 0 1 
= An(An{ i + Ans (4 ') + Ara(An5{\ ) + Ans (5 i) 


= Ay, Ag2a + A,,Ao1b Be Ajo Asoc + Aj 9Ao1d. 
Wait 


Fixed an alternating n-linear function 6. Let k be the value of d(I). We 
want to chaim that 
6(M) = kdet(M). 


First we know that if M has rank less than n, then 6(M) = 0 = det(M) 
by Corollary 2 after Theorem 4.10. So the identity holds. Second if M is 
full-rank, we can write M = E,---E2E I as product of elementary matrices 
and identity matrix J. And it’s lucky now I can copy and paste the text 
in Exercise 4.5.12. 


This time we will claim that 
6( EA) = det(E)6(A) 


for all elementary matrix FE and all matrix A. First, if EF is the elementary 
matrix of type 1 meaning interchangine the i-th and the j-th rows, we have 
EM is the matrix obtained from A by interchanging the i-th and the j-th 
rows. By Theorem 4.10(a) we know that 


5(EM) = -6(A) = det(E)6(A). 


Second, if E is the elementary matrix of type 2 meaning multiplying the 
i-th row by a scalar k, we have EM is the matrix obtained from M by 
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multiplying the i-th row by scalar k. Since the function 6 is n-linear, we 
have 
6(EM) = k6(A) = det(£)d6(A). 


Finally, if FE is the elementary matrix of type 3 meaning adding k times 
the i-th row to the j-th row, we have EM is the matrix obtained from 
M by adding k times the 7-th row to the j-th row. By Corollary 1 after 
Theorem 4.10, we have 


6(EM) = 6(A) = det(E£)6(A). 
This complete the proof since 
6(M) = 0(E,:-E2 E11) = det(E,)--- det (£2) det(£1)d(1) 
= kdet(E,)---det(F2) det( £1) = kdet(M). 
17. Recall the definition 
(61 + 62)(A) = 61(A) + 52(A) 


and 
(kd) (A) = kd(A) 


. For brevity, we write 6’ for 6, + 69 and 6” for ké. Now prove that both 
6’ and 6” is n-linear. Check that 


Uz, + CU, Uz, + CU, Uz, + CU, 
c| Jah] “? [+e] “ 
de Un Un 
U1 V1 U1 U1 
= 61 - + CO} - + 62 "2 + 69 ve 
hi Un Un Un 
UL UL 
= 6] "? | +08"| °? 
Un Un 
Also check that 
uy t+ cuz Uy + CUY 
6" bee ns oe 
Un Un 
Ut V4 U1 U1 
= kd| "2 14 cko| "2 | = 57] 92 1 4+ 05" | 


18. 


19. 


20. 


So both 6’ and 6” is linear function for the first row when other rows are 
held fixed. For the cases on other rows are similar. 


Let the zero element be the zero n-linear function, 6(A) = 0. Thus it can 
be checked that all the properties of vector space hold by the properties 
of field. 


If M has two identical rows, then the matrix M’ obtained from M by 
interchanging the two rows would be the same. So we have 


5(M) = 5(M’) = -6(M). 


When F does not have characteristic two, the equality above means 6(/) = 
0. 


Let 6 be the 2-linear function in Exercise 4.5.15 with a=b=c=de=l1. 
Thus we have that 


a b c od c od 
if? j) =e ade bev bd =5(¢ s)=-5 (5 ) 


The final equality holds since F has characteristic two. But now we have 


1 O 
al p)rte0 
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Chapter 5 


Diagonalization 


5.1 Eigenvalues and Eigenvectors 


1. 


(a) No. For example, the identity mapping Jy has two eigenvalues 1, 1. 
(b) Yes. If we have (A —- AI)v = 0, we also have 


(A-XI)(cv) = c(A- AT)v = 0 


for all ce R. Note that this skill will make the statement false when 
the field is finite. 


-l 
(c) Yes. For example, the matrix means the rotation through 


1 0 
the angle 5. And the matrix has no real eigenvalue and hence no 
real eigenvector. 
(d) No. See definition. 
(e) No. For the matrix Iz, the vectors (1,0) and (2,0) are all eigenvectors 
but they are parallel. 


(f) No. The matrix 4 has only two eigenvalues 1 and —-1. But the 


1 
-1 0 
sum 0 is not an eigenvalue of the same matrix. 


(g) No. Let P be the space of all polynomial and T be the identity 
mapping from P to P. Thus we know 1 is an eigenvalue of T. 

(h) Yes. That a matrix A is similar to a diagonal matrix D means there is 
some invertible matrix P such that P-! AP = D. Since P is invertible, 
P =[I]® for some basis a ,where ( is the standard basis for F". So 
the first statement is equivalent to that A is diagonalizable. And the 
desired result comes from Theorem 5.1. 


(i) Yes. If A and B are similar, there is some invertible matrix P such 
that P-'AP = B. If Av = Xv, we have 


B(P™'v) =AP™+v. 
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And if Bu = Av, we have A(Pv) = Pu. 


(j) No. It usually false. For example, the matrices ( ) and a i 


G90 a-6 9 


But the eigenvector (1,0) of the first matrix is not a eigenvector of 
the second matrix. 


are similar since 


(k) No. The vectors (1,0) and (0,1) are eigenvectors of the matrix 


-1 Oy 


same matrix. 


( : 5) But the sum of them (1,1) is not an eigenvector of the 


2. Compute [T']g as what we did in previous chapters. If that matrix is 
diagonal, then is a basis consisting of eigenvectors. 


(a) No. [T]9 = @ i 


(b) Yes. [T'] = Fi a 


0 0 -l 
0 O 8 
(d) No. | 0 -2 O}. 
-4 0 0 
-1 1 0 
0 -1 1 #O 
(e) No. [T]g = 0 0 -1 O 
0 0 O =-1 
-3 0 0 0 
0 1 0 0 
(f) Yes. 0 01 0; 
0 00 1 


3. Calculate the characteristic polynomial of A and find the zeroes of it to 
solve (i). Find the nonzero vector v such that (A - AI)v = 0 to solve (ii). 
For (iii) and (iv) just follow the direction given by the textbook. 


(a) The characteristic polynomial is t? — 3t- 4 and the zeroes of it are 4 
and -1. For eigenvalue 4, we may solve (A-4I)x=0. There are infinite 
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a 


nN 


wa 


solutions. Just pick one from them, say (2,3). Similarly we can find 
the eigenvector corresponding to —1 is (1,-1). Pick 


B= {(2,3), (1, aay 


o-ing=(5 3): 


where a is the standard basis for R?. Then we know that 


and 


D=O7A0= ( ) 


The characteristic polynomial is — (t- 3) (¢ - 2) (t- 1) with 3 zeroes 
1, 2, and 3. The corresponding eigenvectors are (1,1,-1), (1,-1,0), 
and (1,0,-1). The set of these three vectors are the desired basis. 
And we also have 


and 


1 0 0 
D=Q"'AQ=|0 2 Of. 
Oy Or 23 


The characteristic polynomial is (t - 1) (t + 1) with two zeroes —1 and 
1. The corresponding eigenvectors are (1,-i-1) and (1,-i+1). The 
set of these two vectors are the desired basis. And we also have 


1 1 
a-(, oe 


D=Q74Q = & | 


and 


The characteristic polynomial is —(t - 1) with zeroes 0, 1, and 1. 
The corresponding eigenvectors are (1,4,2), (1,0,1), and (0,1,0). 
The set of these three vectors are the desired basis. And we also 
have 


and 


4. Follow the process of the previous exercise. 


(a) B= {(3,5),(1,2)} and [Tp - ic i) 
2 0 0 
(b) B= {(2,0,-1), (1, 2,0), (1,-1,-1)} and [T], = f -1 ) 
0 
2 O 0 
(c) 8 = {(1,-2,-2), (2,0,-1), (0,2,1)} and [T], = f -1 0 } 
0 


(d) B={2x+3,e+4+2} and [T], = G 5) 


0 0 0 
(e) B= {x-3, 4a? -132-3,x+1} and [T],=]0 2 O}. 
00 4 
1 0 0 0 
0 10 0 
3 2 : 
(f) B= {2° -8,0?-4,0-2,0} and [T]p=|4 9 1 of: 
00 0 3 
-1 00 0 
0 10 0 
2 3 _ zZ 
(g) B= {(1,e-1,30? -2,20°+6e-7} and [T]s=| 9 9 9 of: 
0 0 0 8 
-1 00 0 
1 0 1 0\ (0 1) [0 O 0 10 0 
0 00 1 
-1 0 0 
1 oO) /O 1 1 0O\ fo 1 0 -1 0 
(i) B= ( ap mE (0 i)} and [T]e = 0 0 1 
0 0 0 


a) 
_——— Ee ooo 


5 0 
0) a= ((j anes Ane BG 5) and (T= aca 
0 O O 


5. By definition, that v is an eigenvector of T corresponding to \ means 
Tv = Xv and v # 0. Hence we get 


Tv —- rv = (T-AI)v =0 


and v € N(T - AI). Conversely, if v + 0 and v ¢ N(T - XI), we have 
Tv = Av. It’s the definition of eigenvector. 
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6. We know that 


is equivalent to 


7. (a) We have that 


(b) 
(c) 


(d) 


(e) 


8. (a) 


and 
By-1 _ 
({7]5)" = L7Y5- 
So we know that 


det ([T']s) = det([Z]}) det([T],) det([Z]},) = det([T],). 


It’s the instant result from Theorem 2.18 and the Corollary after 
Theorem 4.7. 


Pick any ordered basis 8 and we have 
det(T~*) = det([Z~*]g) = det(([T]g)*) = det([T]g)"* = det(T)*. 


The second and the third equality come from Theorem 2.18 and the 
Corollary after Theorem 4.7. 


By definition, 
det(TU) = det([TU]s) = det([T][U]s) 
= det([T],) det([U]g) = det(T) det(U). 
We have 
det(T - Aly) = det([T - AI]g) = det((T]g - A[Z]) = det([T]g - AZ). 


By previous exercises, we have that T is invertible if and only if 

det(T’) #0. And the fact det(7’) # 0 is equivalent to the fact N(T - 

OL) = {0}, which is equivalent to that zero is not an eigenvalue of T 

by Theorem 5.4. 

Since T = (T~+)~1, it’s enough to prove only one side of the statement. 

If \ an eigenvalue with eigenvector v, we have Tv = Xv and so T7!v = 

d-!v. This means 7! is an eigenvalue of Tt. 

(a) A matrix M is invertible if and only if 0 is not an eigenvalue of 
M. 

(b) Let M be an invertible matrix. We have is an eigenvalue of 
M if and only if \~+ is an eigenvalue of Mt. 

First, if M is invertible, then there’s no vector v such that Mv = 0v = 

0. So 0 is not an eigenvalue of WM. If 0 is not an eigenvalue of M, 

then v = 0 is the only vector sucht that Mv = 0. This means that M 

is injective and so invertible since M is square. Second, it’s enough 

to prove one side of that statement since M = (M~')~'. And if we 

have Mv = Xv, then we have M~'v = \"!v. 
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9. This is directly because the determinant of an upper triangular matrix is 
the product of it’s diagonal entries. 


10. 


11. 


12. 


13. 


(a) 
(b) 


(c) 


What did the author want to ask? Just calculate it! 


Just calculate it and get the answer (A-t)", where n is the dimension 
of V. 


We already have that for any ordered basis 6, [AIv ]g = AJ, a diagonal 
matrix. So it’s diagonalizable. By the previous exercise we also have 
that the only eigenvalue is the zero of the characteristic polynomial, 
X. 


It’s just because 
AaB MBS 


for some invertible matrix B. 


Let M be the matrix mentioned in this question and . be that only 
eigenvalue of M. We have that the basis to make M diagonalizable 
if consisting of vectors v; such that (M - AI)u; = 0. This means 
(M - XI)v = 0 for every v since {v;} forms a basis. So M must be 
Al. 


It’s easy to see that 1 is the only eigenvalue of the matrix. But since 


nullity of 
1 1) f1 0) _/0 1 
0 1 0 1) \O O 


is one, we can’t find a set of two vector consisting of eigenvectors 
such that the set is independent. By Theorem 5.1, the matrix is not 
diagonalizable. 


Use the fact if A = P-'BP, then 
det(A — AI) = det(P7(A- AI)P) 
= det(P1AP - PIP) = det(B-AlI). 
Use the result of the previous exercise and the fact that each matrix 


representation of one linear operator is pairwisely similar to each 
other. 


Since the diagram is commutative, we know that 
T(v) = o5' (T(s(v))) = $5 (Ada(v)) = Av. 


One part of this statement has been proven in the previous exercise. 
If og (y) is an eigenvector of T corresponding to A, we have 


Ay = b3(T(95' (y))) = $8 (Aba(y)) = AY. 
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14. Use the fact 


15. 


16. 


17. 


18. 


(a) 


(b) 
(a) 


(b) 


det(A — AI) = det((A — AT)*) = det(A® - XZ). 
If we have T(v) = Av for some v, then we also have that 
Te Qyet™ | Og) sar” Wuyss extn 


You can just replace the character T’ by the character A. 


Just as the Hint, we have 
tr(P7' AP) = tr(PP™'A) = tr(A). 


We may define the trace of a linear operator on a finite-dimensional 
vector space to be the trace of its matrix representation. It’s well- 
defined due to the previous exercise. 


If T(A) = A’ = AA for some \ and some nonzero matrix A, say 
Aj; #0, we have 


and 

Aji = VAG; 
and so 

Ajj = Ajj. 


This means that can only be 1 or -1. And these two values are 
eigenvalues due to the existence of symmetric matrices and skew- 
symmetric matrices. 


The set of nonzero symmetric matrices are the eigenvectors corre- 
sponding to eigenvalue 1, while the set of nonzero skew-symmetric 
matrices are the eigenvectors corresponding to eigenvalue —1. 


outa vo (! 8),(0 2), 9,1 9) 


Let £;; be the matrix with its 7j-entry 1 and all other entries 0. Then 
the basis could be 


{Fix }iet2,....n U {Big + Byihing U {Big — Ejihin;. 


If B is invertible, we have B~! exists and det(B) +0. Now we know 
that 
det(A + cB) = det(B) det(B"'A+ cl), 


a nonzero polynomial of c. It has only finite zeroes, so we can always 
find some c sucht that the determinant is nonzero. 
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19. 


20. 


21. 


(b) Since we know that 


tee : me 
ec lt+e 


1 1 0 0 
take A=(j ‘ and B=(\ i 


Say A= P"!BP. Pick V =F", T = Ly, and £ be the standard basis. We 
can also pick y such that P = [Ee That is, y is the set of the column 
vectors of P. 


From the equation given in the question, it’s easy that f(0) =a. And by 
definition, we also know that f(t) = det(A- tJ) and so f(0) = det(A). So 
A is invertible if and only if ag = det(A) # 0. 


(a) We should use one fact that if B is a matrix with number of nonzero 
entries less than or equal to k, then we have det(A-tB) is a poly- 
nomial of t with degree less than or equal to k. To prove this, we 
induct on both k and the size of matrix n. If k = 0, we know that B 
is a zero matrix and det(A-tB) is constant. For n = 1, it’s easy to 
see that degree of det(A-tB) is equal to 1, which will be less than 
or equal to k if k > 1. Suppose the hypothesis holds for n = k-1. For 
the case n = k, we may expand the determinant along the first row. 
That is, 


det(A-tB) = > (-1)'*9(A-tB)1; det(A-tB,,). 


j=l 


If the first row of B is all zero, then det(A-tB1;) is a polynomial 
with degree less than or equal to k and (A-tB),,; contains no t 
for all j. If the first row of B is not all zero, then det(A-tBy;) 
is a polynomial with degree less than or equal to k - 1 and each 
(A-tB),; contains t with degree at most 1. In both case, we get 
that det(A-tB) is a polynomial with degree less than or equal to k. 
Now we may induct on n to prove the original statement. For n = 1, 
we have f(t) = Ai; -t. For n = 2, we have f(t) = (Ai — t)(Az2 - t) 
Aj2A21. Suppose the hypothesis is true for n = k-1. For the case 
n=k, we expand the determinant along the first row. That is, 


= k a “ 
det(A - tL) = (Ai, - ¢) det((A - t1)11) + be (-1)°%? det((A - t2)15. 
j=2 
By the induction hypothesis, we know that 
det((A - #1) 11) = (Ag2 - t)(Ag3 - t)-( Ane - t) + v(t), 


where p(t) is a polynomial with degree less than or equal to k - 3, 
and 


(-1)**9 det((A = tl)1; 
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22. 


wa 


wa 


is a polynomial with degree less than or equal to k-2. So it becomes 
(Aiit)(Az2-t)--(Ann=t) +(Anr-t)p(t)+ 9 (-1)" det((A = HD) 5, 
j=2 


in which the summation of the second term and the third term is a 
polynomial with degree less than or equal to n- 1. 


By the previous exercise, we know that the coefficient of t’-' comes 
from only the first term 


(Aji — t)( Age - t)-*(Ann - t) 


and it would be 


(rye > Ais = tr(A). 


Use the notation x and 4 that in this question. In Exercise 5.1.15(a) 


we already have 
T(x) =A a. 


And it’s also easy to see that if Aw = A,” and Ba = A2x, we'll have 
(A+ B)x = AL + AQ. 


So we get the desired result. 


(b) Shit! It’s not funny! Just change T into A. 
(c) Just calculate that 


#4)=(5 a9} 


10 ()-*6) 


Also check that = 4 and g(4) = 29. 


and 


23. If T is diagonalizable, there is a basis (6 consisting of eigenvectors. By the 


24. 


previouse exercise, we know that if T(x) = Aq, 


f(T)(v) = f(A)v = 0v = 0. 


This means that for each v € 6, we have f(T)(v) = 0. Since £ is a basis, 
F(L) = To. 


(a) The coefficient of t” comes from the first term of that equality in 


Exercise 5.1.21(a). So it’s (-1)”. 


(b) A polynomial of degree n has at most n zeroes. 


25. Where is the Corollaries? 
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26. By Exercise 5.1.20 and Exercise 5.1.21(b) and Theorem 5.3(b), we know 
the characteristic polynomial must be 


t? — tr(A)t + det(A). 


And the coefficient could be 0 or 1. So there are 4 possible characteristic 
polynomials. By checking that all of them are achievable, we get the 
answer are 4. 


5.2 Diagonalizability 


1. (a) 


(g) 


(h) 


No. The matrix 


1 0 0 
0 0 0 
0 0 0 


has only two distinct eigenvalues but it’s diagonalizable. 


No. The vectors (;) and (;) are both the eigenvectors of the matrix 


(5 : corresponding the same eigenvalue 1. 


) No. The zero vector is not. 


) Yes. If a € Ey, n Ey,, we have \\x2 = Ax = Ax. It’s possible only 


when x = 0. 


Yes. By the hypothesis, we know A is diagonalizable. Say A = 
PDP for some invertible matrix P and some diagonal matrix D. 
Thus we know that 


Q*AQ = (PQ) D(PQ). 
No. It need one more condition that the characteristic polynomial 


spilts. For example, the matrix (; ) has no real eigenvalue. 


Yes. Since it’s a diagonalizable operator on nonzero vector space, it’s 
characteristic polynomial spilts with degree greater than or equal to 
1. So it has at least one zero. 


Yes. Because we have 
Wu »y Wy, = {0} 
1#k 


and 
» Wi. = {0} =) W; 


i#k 


for all 7 #7, we get the desired answer. 
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(i) No. For example, take W, = span{(1,0)}, W2 = span{(0,1)}, and 
W3 = span{(1, 1)}. 


2. For these question, see the direction of the subsection “Test for Diagonal- 
ization”. 
(a) It’s not diagonalizable since dim( £1) is 1 but not 2. 


(b) It’s diagonalizable with D = i i) and Q = & i) 


(c) It’s diagonalizable with D = Gi *) and @ = fe 


3.0 =O 1 0 2 
(d) It’s diagonalizable with D=|0 3 OJfandQ=]1 0 4). 
0 0 -1 0 1 3 
(e) It’s not diagonalizable since its characteristic polynomial does not 
split. 
(f) It’s not diagonalizable since dim( £1) is 1 but not 2. 


4 0 0 1 1 O 
(g) It’s diagonalizable with D=|0 2 OJandQ=]2 O 1 J. 
0 0 2 -1 -1 -l 
3. For these question, we may choose arbitrary matrix representation, usually 
use the standard basis, and do the same as what we did in the previous 


exercises. So here we’ll have [T]g = D and the set of column vectors of Q 
is the ordered basis 6. 


(a) It’s not diagonalizable since dim( Eo) is 1 but not 4. 
-1 0 0 1 1 0 
(b) It’s diagonalizable with D=]| 0 1 OJandQ=]0 O 1}. 
0 0 1 -1 1 0 


(c) It’s not diagonalizable since its characteristic polynomial does not 
split. 


1 0 0 1 1 1 
(d) It’s diagonalizable with D=]0 2 OJandQ=]1 1 -1lj}. 
0 0 0 -1 0 


0 
i . dee deg. i my ee ee 
(e) It’s diagonalizable with D=( 0 a) and a= (4 i} 
-1 0 0 O 0 1 0 0 
ait be ret on 8 IO aes 0 it 0, 4:0 
(f) It’s diagonalizable with D = 0 01.0 and @ = Sa vgs Or 
0 0 0 1 0 0 0 1 


4. It’s not funny again. Replace the character T’ by the character A in that 
prove. 
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5. It’s not funny again and again. Replace the character T by the character 
A in that prove. 


6. (a) An operator T is diagonalizable ensure that its characteristic poly- 
nomial splits by Theorem 5.6. And in this situation Theorem 5.9(a) 
ensure that the multiplicity of each eigenvalue meets the dimension 
of the corresponding eigenspace. Conversly, if the characteristic poly- 
nomial splits and the multiplicity meets the dimension, then the op- 
erator will be diagonalizable by Theorem 5.9(a). 


(b) Replace T by A again. 


7. Diagonalize the matrix A by Q-'AQ = D with D = (; 3) and Q = 


(; 2) So we know that 


n np 5” 0 = 
A =QD a4 le 


8. We know that dim(£),) 21. So pick a nonzero vector v € Ey,. Also pick a 
basis 6 for Fy,. Then a = GU {v} forms a basis consisting of eigenvectors. 
It’s a basis because the cardinality is n and the help of Theorem 5.8. 


9. (a) Because the characteristic polynomial of T is independent of the 
choice of 6, we know that the characteristic polynomial 


f(t) = det([T],- tl) = I ((7 I) -1) 


splits, where the second equality holds since it’s a upper triangular 
matrix. 


(b) The characteristic polynomial of a matrix is also the same for all 
matrices which is similar to the original matrix. 


10. This is because the equality in Exercise 5.2.9(a). That is, if [T], is an 
upper triangular matrix, it’s diagonal entries are the set of zeroes of the 
characteristic polynomial. 


11. (a) Since eigenvalues are the zeroes of the characteristic polynomial, we 
may write the characteristic polynomial f(t) as 


(Ar — t) (Ag — ty? (Ag - t)™*. 


Calculate the coefficient of t”~! and use the fact in Exercise 5.1.21(b). 
Thus we could get the desired conclusion. 


(b) Use the equality in the previous exercise and calculate the coefficient 
of the constant term. Compare it to the Exercise 5.1.20. 
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12. 


13. 


14. 


(a) Let E, be the eigenspace of T corresponding to A and E)-1 be the 


(b 


(a) For matrix A = 


) 


wa 


NK 


wa 


wa 


eigenspace of T~! corresponding to \~'. We want to prove the two 
spaces are the same. If v ¢ Ey, we have T(v) = Av and so v = AT™!(v). 
This means T-!(v) = A7!v and v € Ey-1. Conversely, if v € Ey-1, we 
have T~!(v) = A7!v and so v = \"!T(v). This means T(v) = Av and 
VE Ey. 

By the result of the previous exercise, if T’ is diagonalizable and 
invertible, the basis consisting of eigenvectors of T’ will also be the 
basis consisting of eigenvectors of T7. 


; : , corresponding to the same eigenvalue 0 we 
have Eo = span{(0, 1)} is the eigenspace for A while Ep = span{(1,-1)} 
is the eigenspace for A’. 


Observe that 
dim(F)) = null(A - AJ) = null((A- X)*) 
= null( A’ — AZ) = dim(E4). 
If A is diagonalizable, then its characteristic polynomial splits and 
the multiplicity meets the dimension of the corresponding eigenspace. 
Since A and A?’ has the same characteristic polynomial, the charac- 
teristic polynomial of A‘ also splits. And by the precious exercise we 


know that the multiplicity meets the dimension of the corresponding 
eigenspace in the case of A’. 


Let v = (a, y) and v' = (2’,y’) and A= e | We may write the 


system of equation as Av = uv’. We may also diagonalize the matrix 
A by Q7!AQ = D with D = be ;) and Q = fe i This means 
D(Q™!v) = (Qu! v)’. So we know that 


and 
6 ce72t 
UV nm 
C2 e2t ) 
where c; is some scalar for all 7. 


3. «(0 2 od 
Calculate D = (; 5) and @ = & a) So we have 


where c; is some scalar for all 7. 
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1 0 O 1 1 
(c) Calculate D=|0 1 O] and Q=]0 1]. So we have 
0 0 2 0 1 


where c; is some scalar for all 7. 


15. Following the step of the previous exercise, we may pick a matrix Q whose 
column vectors consist of eigenvectors and Q is invertible. Let D be the 
diagonal matrix Q~'AQ. And we also know that finally we’ll have the 
solution « = Qu for some vector u whose i-th entry is cje* if the i-th 
column of Q is an eigenvector corresponding to ’. By denoting D to be 
the diagonal matrix with D,; = e?, we may write x = QDy. where the 
i-th entry of y is c;. So the solution must be of the form described in the 
exercise. 


For the second statement, we should know first that the set 
fore gre eo} 


are linearly independent in the space of real functions. Since Q invertible, 
we know that the solution set 


{QDy: ye R"} 
is an n-dimensional real vector space. 


16. Directly calculate that 


(CY); = (= Cin Yay)’ 
k=l 


Me 


Cai =Or'. 


17. (a) We may pick one basis a@ such that both [T], and [U]q are diagonal. 
Let Q = [J]%. And we will find out that 
[T]a = Q"[T]sQ 
and 
[U]a = Q''[U]sQ. 
(b) Let Q be the invertible matrix who makes A and B simultaneously 


diagonalizable. Say (@ be the basis consisting of the column vectors 
of Q. And let a be the standard basis. Now we know that 


[T]e= WET alf]¢ = Q"*AQ 


and 
[U]s = [118 [Val t]g = Q-* BQ. 


18. (a) Let 8 be the basis makes T and U simultaneously diagonalizable. We 
know that each pair of diagonal matrices commute. So we have 


And this means T and U commute. 


(b) Let Q be the invertible matrix who makes A and B simultaneously 
diagonalizable. Thus we have 


(Q-'AQ)(Q™" BQ) = (Q7'BQ)(Q* AQ). 
And this means that A and B commute since Q is invertible. 
19. They have the same eigenvectors by Exercise 5.1.15. 
20. Since we can check that 
W@W. 0 W3 = (W, @ W2) © Ws, 


if V is direct sum of W,, W2,...Wz, the dimension of V would be the sum 
of the dimension of each W; by using Exercise 1.6.29(a) inductively. 


Conversely, we first prove that if we have 
k 
> Wi = V, 
i=l 

then we must have 


i=l 
We induct on k. For k = 2, we may use the formula in Exercise 1.6.29(a). 
Suppose it holds for k =m. We know that 


m-1 m 
(>. Wi) + Wn = > Wi 
i=1 t=1 


and 
dim(3> Wi) < dim(S" W;) + dim(W,,) < ss dim(W;) 


i=1 i=l i=1 
by induction hypothesis and the case for k = 2. 


To prove the original statement, suppoe, by contradiction, that 
Win i= 2'W; 
has nonzero element. By the formula in Exercise 1.6.29(a) we know that 
k 
dim(5*i = 2"W;) > dim(V) - dim(W,) = )* dim(W;). 
i=2 


This is impossible, so we get the desired result. 
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21. 


22. 


23. 


Because ( is a basis, we have 


k 

» span({;) = V. 

i=l 
Second, since the dimension of span({;) is the number of element of {;, 
we know that the equality in the previous exercise about dimension holds. 
So V is the direct sum of them. 


By the definition of the left hand side, it is the sum of each eigenspaces. 
Let W = ee E),. If there is some nonzero vector v; in Ey, 1W. We may 
write 

Uy + CQVQ + C303 +++ + CKUR = 0 


for some scalar c; and some eigenvectors v; € Ey,. Now we know that 
0 = T(O) = Aquz + corAQVe + CZAZU3 +77 + CRARUE = 0. 
After substracting this equality by \, times the previous equality, we get 
€2(Ag — A1) v2 0 + OKA — A1)ug = 0. 


This is impossible since \;— A, is nonzero for all 7 and c; cannot be all zero. 
Similarly we know that #,, has no common element other than zero with 
the summation of other eigenspaces. So the left hand side is the direct 
sum of each eigenspaces. 


It’s enough to check whether Ky has common nonzero element with the 
summation of others or not. Thus we can do similarly for the other cases. 
Let V; be the summation of Ka, K3,..., Kp. Now, if there is some nonzero 
vector « € Ky(Vi 9 W2), we may assume that « = u+v with ue Vi and 
v € Wo. Since W, is the direct sum of all K;’s, we know Ki nV, = {0}. 
So «-u=v#0 is an element in both W,; and Wa, a contradiction. Thus 
we’ve completed the proof. 


5.3. Matrix Limits and Markov Chains 


1. 


(a) Yes. This is the result of Theorem 5.12. 
(b) Yes. This is the result of Theorem 5.13. 


(c) No. It still need the condition that each entry is nonnegative. 


(d) No. It’s the sum of each column. The matrix (; i} is a counterex- 
ample. 

(e) Yes. See the Corollary after Theorem 5.15, although there’s no proof. 

(f) Yes. See the Corollary 1 after Theorem 5.16. 

(g) Yes. Look Theorem 5.17. 


123 


(h) No. The matrix bi 


1 i has eigenvector (1,-1) corresponding the 


eigenvalue —1. 


(i) No. The matrix A = (’ has the property that A? = I. So the 
sequence would be A,J,A,J,..., which does not converge. 


(j) Yes. This is Theorem 5.20. 


2. Diagonalize those matrices and use the fact of the Corollary after Theorem 
5.12. Actually the eigenvalue will not tend to zero if and only if the 
eigenvalue is 1. 

(a) The limit is the zero matrix. 


=05 2 


(b) The limit is ie 15 


tT 7 
(c) The limit is é i). 
13 13 
(d) The limit is the zero matrix. 


(e) The limit does not exist. 


fe afd, Sl 
(f) The limit is i; 3) 


-1 0 -l 
(g) The limit is}-4 1 -2]. 
2 0 2 


6 9 38 


(i) The limit does not exist. 


2) eB sa 
(h) The limit is 0 0 0 J. 


(j) The limit does not exist. 
3. We know that 


lim (Am)i; = lim (Am) ji = Lyi = Li, 


4. If A is diagonalizable, we may say that Q-!AQ = D for some invertible 
matrix Q and for some diagonal matrix D whose diagonal entries are eigen- 
values. So lim. D™ exist only when all its eigenvalues are numbers in 
S, which was defined in the paragraphs before Theorem 5.13. If all the 
eigenvalues are 1, then the limit L would be [,,. If some eigenvalue  # 1, 
then its absolute value must be less than 1 and the limit of \” would 
shrink to zero. This means that D has rank less than n. 
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5. First we see three special matrices, 


(oo )a-G )e-G a) 


The limit of power of these three matrices is themselves respectly, because 
they has the property that their square are themselves. However, we know 
that 1 1 
—P.—P=R. 
V2 V2 
So we pick A = aP, B= 59: and C = R. Thus we have the limit of 
power of A and B are the zero matrix. And the limit of power of C is C 
itself. 


6. Let x, y, z, and w denote the percentage of the healthy, ambulatory, 
bedridden, and dead patients. And we know that 


x 0 
y|_ 10.3 
z| 10.7 
w 0 


and the percentage of each type would be described by the vector 


1 06 01 0\/0)\ (0.25 
0 0.2 02 O}f03] | 02 


0 0.2 0.5 Of] 0.7] [0.41 
0 O O02 1/\0 0.14 


in the same order. So this answer the first question. For the second 
question we should calculate the limit of power of that transition matrix, 
say A. It would be 


L= lim A™ = 


moo 


1 
0 
0 


eS SOs 


0 


OlKkK CO Cola 
olk CO Col 


So the limit of percentage of each type would be described by the vector 


59 
“lo71=|0 
31 

0 90 


7. Using the language of the stochastic matrix, we direct write the stochastic 
matrix to be 


14+ 0 0 
0 0 % 0 
= 3 
A=l0 2 0 0 
00 2 1 
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I don’t think there is too much difference between the process of diag- 
onalizing and finding the eigenvectors. It’s much easier to observe the 
sequence 

€2, Aeo, A’ep, eee 


>) 


which is 
oy 7ty hy sty stb 
3 3 27 af 
1 0 é 0 a 
2]> A o| 4 |> s ’ 
On ie 0 = 0 
3 27 
of Ny Va) Vy 4 
9 9 81 


And find the limit of the first entry to be 


hy De De De De Ds De Die al: 


ee ee ee ee ae 
3.39 399 3 ‘9 9 


So the answer is a 


8. There’s no better method to check whether the matrix is regular or not. 
So just try it or find the evidence that the power of it will not be a positive 
matrix. When the matrix is nonnegative, we may just consider the matrix 
obtained by replacing each nonzero entry by 1. 

(a) Yes. The square of it is positive. 
(b) Yes. It’s positive when the power is 4. 


. The second and the third column always interchange each time. 


. The second and the third columns do not change. 
No. The first column does not change. 


No. The third and the fourth columns do not change. 


) 
) 
(c) No 
(d) No. The second column does not change each time. 
) No 
) 
) 
) 


No. The third and the fourth columns do not change. 


9. Use the same method as that in Exercise 5.3.2. Or we may use the result 
of 5.20 for the case of regular matrix. 


—. , $2 3 3 

(a) The limitis]3 3 3 
303 ~=«3 

2 “Ose £2, 

(b) The limit is rT A 7q 


(c) The limit does not exist since the second and the third columns 
interchange each time. 
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0 0 0 
(d) The limit is} 1 1 1). 
0 0 0 
0 0 0 
(e) The limit is 2 1 O}. 
20° A 
1) i> «0 
(f) The limit is}0 2 2]. 
ie 
5 5 
0 0 0 0 
(g) The limit is ‘ fi : al 
ia 
0 0 0 0 
(h) The limit is : : ae 
i 4 


10. To calculate the vector after two stage, we could only multiply the matrix 
twice and multiply the vectors. To calculate the limit matrix we could 
just find the eigenvector corresponding to eigenvalue 1 and use Theorem 
5.20. 


(a) The two-stage vectors and the fixed vectors are 


0.225\ (0.2 
0.441 | ,| 0.6 
0.334) \0.2 


(b) The two-stage vectors and the fixed vectors are 


0.375\ (0.4 
0.375 |, | 0.4 
0.25} \0.2 


(c) The two-stage vectors and the fixed vectors are 


0.372\ 0.5 
0.225 | ,} 0.2 
0.403) \0.3 


respectly. 


respectly. 


respectly. 
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(d) The two-stage vectors and the fixed vectors are 


0.252\ (0.25 
0.334 | , | 0.35 
0.414 0.4 


(e) The two-stage vectors and the fixed vectors are 


0.329\ /2 
0.334], t 
0.337 i 


(f) The two-stage vectors and the fixed vectors are 
0.316\ (0.25 
0.428 | ,} 0.5 
0.256) \0.25 


0.7 0.2 0 
A=10.1 0.6 0.2]. 


respectly. 
respectly. 


respectly. 


11. The matrix would be 


0.2 0.2 0.8 
So the vector in 1950 would be 


0.1 0.197 
A] 0.5 | = | 0.339 |. 
0.4 0.464 


Since it’s regular, we may just find the fixed vector 


0.2 
0.3]. 
0.5 


12. Considering the three states, new, once used, and twice used, the matrix 


1S 
1 
O]. 
0 


It is regular. So we can just find the fixed vector 


ane 

Il 
— 
Owlnw|r 
win Owl 


13. 


14. 


15. 


Considering the three states, large, intermediate-sized, and small car own- 
ers, the matrix is 
0.7 O01 O 
0.3 0.7 0.1 
0 02 0.9 
and the initial vector is 
0.4 
P=]0.2]. 
0.4 


So the vector in 1995 would be 


0.24 
A’?P =] 0.34]. 
0.42 


And the matrix A is regular. So we may just find the fixed vector 


0.1 
0.3 
0.6 


We prove it by induction on m. When m = 1, the formula meet the matrix 
A. Suppose the formula holds for the case m =k-1. Then the case m=k 
would be 


Tk-1 Tk Tk 
AP=A®1A=| rp rea re JA 
Tk Tk Tk-1 
Th Tea1 Tk TE-1V+tTK 
a Tk-1tTk Th Tk-1tTlk 
2 
CRATE Tk-1tTk 
2 2 Tk 
After check that 
(-1)* (-1)* 
Tk-1 +1k 1 1+ QR-2 +1+ QR-1 1 (-1)* 
eRe =5[1+ k ]= ret 
2 3 2 2 


we get the desired result. To deduce the second equality, just replace A” 
by the right hand side in the formula. 


Let v be that nonnegative vector and say d is the sum of all its entries. 
Thus we have x = tv is a probability vector. Furthermore, if y is a 
probability vector in W. They must be parallel, say y = kx. The sum of 
entries of y is k by the fact that x is a probability vector. And the sum of 
entries of y is 1 since itself is a probability vector. This means k = 1. So 
the vector is unique. 
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16. 


17. 


18. 


19. 


If A is a (not necessarily square) matrix with row vectors Aj, Ao,...,A,, 
we have the fact 

Atu = At + Ab +--+ Ab. 
This vector equal to u if and only if the sum of entries is 1 for every column 
of A. Use this fact we’ve done the proof of Theorem 5.15. 


For the Corollary after it, if M is a transition matrix, we have 
(M*)'u = (M*)*u = (M*t)P lus. =u. 
And if v is probability vector, 
(Mv)'u=v'M'u=v'u= (1). 
By Theorem 5.15 we get the conclusion. 


For the first Corollary, we may apply Theorem 5.18 to the matrix A’ since 
we have the fact that A and A’ have the same characteristic polynomial. 
Thus we know that A = v(A). Also, the dimension of eigenspace of A’ 
corresponding to \ is 1. But Exercise 5.2.13 tell us that A‘ and A has the 
same dimension of the corresponding eigenspaces. 


For the second Corollary, we know that v(A) = 1. So if \ + 1 then we 
have |A| < 1 by Theorem 5.18 and its first Corollary. And the eigenspace 
corresponding 1 has dimension one by the first Corollary. 


By Theorem 5.19, all eigenvalues lies in S$, which was defined in the para- 
graphs before Theorem 5.13. Thus the limit exist by Theorem 5.14. 


(a) First we check that 
(cM + (1-c¢)N)’u=cM'u+ (1—c)N'u =cut (1-c)u= u. 


Thus the new matrix is a transition matrix by the Corollary after 
Theorem 5.15. So it’s enough to show that the new matrix is regular. 
Now suppose that M* is positive. Then we know that (cM + (1 - 
c)N)* is the sum of c*M* and some lower order terms, which are 
nonnegative. So we know that it’s a positive matrix and so the new 
matrix is regular. 


(b) Pick a scalar d such that each entry of dM’ is larger than that of M. 
Then we may pick c= + and 


1 
N = ——(M'-cM) 
l-c 


and know that N is nonnegative. Finally we may check that 


1 1 
N'u= —(M"u-cM'u) = -(l-c)u=u. 
L>€ l-e 


So N is also a transition matrix by the Corollary after Theorem 5.15. 
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(c) By symmetry, it’s enough to prove only the one side of that state- 
ment. Also, by (b) we could write 


M'=cM+(1-c)N 


for some scalar c and some transition matrix N. Now, if M is regular, 
then M’ is also regular by (a). 


20. Use the notation of the Definition, when A = O we can observe that B,, 
is equal to I for all m. So e? would be J. For the case A = J, we have 
A™ =T for all m. So the matrix e/ = ef. That is, a matrix with all its 


entries e. 
21. We set 

2 A™ 
By =I+ A+ 4++++— 
2! m! 

and 3 
Bm = 1+ D+ +04 a 
! m! 


We may observe that B,, = PE,,P™' for all m. So by the definition of 
exponential of matrix and Theorem 5.12, we have 


e4 = lim Bm = lim (PEmP~') = P( lim Em)P™ = Pe? Pl. 


m-—-> co 


22. With the help of previous exercise, it’s enough to show that e? exist if 
P-'AP =D. Since we know that 


Cae Day”, 
we have that ,with the notation the same as the previous exercise, 


..\2 ..)m 
(Bm)ia= 1+ Dut PH 5. Pi” 


mi ? 


which tends to e?* as m tends to infinity. So we know that e? exists. 


23. The equality is usually not true, although it’s true when A and B are 


diagonal matrices. For example, we may pick A = (; i and B= li i) 


By some calculation we have 


and 
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and 


However, we may also calculate that 


eitB _ e? a ' 
0 1 


24. Use the notation in the solution to Exercise 5.2.15. And in the proof we 
know that the solution x(t) = QDy for some y ¢ R”. Now we know that 
actually D = e? by definition. With the fact that Q is invertible, we may 
write the set of solution to be 


{Qe?Q lu: veR"} = {e4u: ve R"} 


by Exercise 5.3.21. 


5.4 Invariant Subspace and the Cayley-Hamilton 
Theorem 


1. (a) No. The subspace {0} must be a T-invariant subspace. 
(b) Yes. This is Theorem 5.21. 


(c) No. For example, let T be the identity map from R to R and v = (1) 

and w = (2). Then we have W = W’‘=R. 

(d) No. For example, let T be the mapping from R? to R? defined by 
T(x,y) =y. Pick v = (1,1). Thus the T-cyclic subspace generated 
by v and T(v) are R? and the z-axis. 

e) Yes. The characteristic polynomial is the described polynomial. 


f 
& 


( 
(f) Yes. We may prove it by induction or just use Exercise 5.1.21. 


) 
) 
(g) Yes. This is Theorem 5.25. 

(a) Yes. For every ax? + ba +ce W, we have 


T(ax? + ba +c) = 2ar+be W. 


(b) Yes. For every ax? + bx +ce W, we have 


T (ax? + bx +c) = 2ax” + br € W. 


(c) Yes. For every (t,t,t) « W, we have 


T(t, t,t) = (3t, 3t, 3t) « W. 
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(d) Yes. For every at + be W, we have 


T(at +b) = (5 +d)teW. 
(e) No. For (; H) e W, we have 


ray=() pew. 


3. (a) We have T(0) = 0 and T(v) € V any ve V and for arbitrary linear 
operator T’ on V. 


(b) If v € N(T), we have T(v) = 0 € N(T). If v € R(T), we have 
T(v) € R(T) by the definition of R(T). 


(c) If ve Ey, we have T(v) = Av. Since T(Av) = AT(v) = A*v, we know 
that T(v) € By. 


4. We have 
T'(W)c Te (T(W)) oT (W) cc W. 


For any v € W, we know that g(T)(v) is the sum of some elements in W. 
Since W is a subspace, we know that g(T)(v) is always an element is W. 


5. Let {Wi}ier be the collection of T-invatirant subspaces and W be the 
intersection of them. For every v € W, we have T(v) € W; for every i € J, 


since v is an element is each W;. This means Tv) is also an element in 
WwW. 


6. Follow the prove of Theorem 5.22. And we know that the dimension is 
the maximum number & such that 


eC) Te bgt 3 Wrenn! dua 3 
is independent and the set is a basis of the subspace. 
(a) Calculate that 
z= (1,0,0,0),7T(z) = (1,0,1,1), 
T(z) = (1,-1,2, 2), T°(z) = (0, -3, 3, 3). 
So we know the dimension is 3 and the set 
{z, T(z), T’(z)} 


is a basis. 
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(b) Calculate that 
z=2°,T(z) = 62,T?(z) = 0. 


So we know that the dimension is 2 and the set 
{z,T(z)} 


is a basis. 
(c) Calculate that T(z) = z. So we know that the dimension is 1 and {z} 


is a basis. 
0 1 1 1 
e=(‘ 0) Te (3 Ar 


(d) Calculate that 
icc: SB 
T(z)= (; :) : 


So we know that the dimension is 2 and the set 
{z,T(z)} 
is a basis. 


7. Let W be a T-invariant subspace and Ty be the restricted operator on 
W. We have that 


R(Tw) =Tw(W) =T(W) cw. 
So at least it’s a well-defined mapping. And we also have 
Tw (x) + Tw(y) = T(@) + T(y) = Te +y) = Tw(e ty) 


and 


Tw (cx) = T (cx) = cT (x) = cTw(z2). 


So the restriction of T on W is also a lineaer operator. 


8. If v is an eigenvector of Tw corresponding eigenvalue A, this means that 
T(v) = Tw(v) = Av. So the same is true for T. 


9. See Example 5.4.6. 


(a) For the first method, we may calculate T?(z) = (3,-3,3,3) and rep- 
resent it as a linear combination of the basis 


T?(z) = Oz - 3T(z) + 8T?(z). 


So the characteristic polynomial is -¢® + 3t? — 3t. For the second 
method, denote 6 to be the ordered basis 


{z, T(z), T?(z),T?(z)}. 
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And we may calculate the matrix representation 


00 0 
[Twle=|1 0 -3 
(ae eee 


and directly find the characteristic polynomial of it to get the same 
result. 


(b) For the first method, we may calculate T?(z) = 0 and represent it as 
a linear combination of the basis 


T(z) = 0z+0T(z). 


So the characteristic polynomial is t?. For the second method, denote 
B to be the ordered basis 


{z,T(z)}. 


And we may calculate the matrix representation 


and directly find the characteristic polynomial of it to get the same 
result. 


(c) For the first method, we may calculate T(z) = z. So the characteristic 
polynomial is -t+1. For the second method, denote { to be the 
ordered basis {z}. And we may calculate the matrix representation 


[Tw] = (1) 


and directly find the characteristic polynomial of it to get the same 
result. 


(d) For the first method, we may calculate T?(z) = 3T(z). So the char- 
acteristic polynomial is ¢? — 3t. For the second method, denote 3 to 
be the ordered basis 


{z,T(z)}. 


And we may calculate the matrix representation 


and directly find the characteristic polynomial of it to get the same 
result. 


10. Calculate the characteristic polynomial is the problem of the first sec- 
tion and determine whether on polynomial is divided by the other is the 
problem in senior high. 
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(a) The characteristic polynomial is 
t4 — 4t? + 6t? — 3t. 
(b) The characteristic polynomial is ¢*. 
(c) The characteristic polynomial is 
t4 —2t? + 2-1. 
(d) The characteristic polynomial is 
t* — 6t? + 9t?. 


11. (a) Let w be an element in W. We may express w to be 


k * 
we al): 
i=0 
And thus we have 
k . 
T(w) = > a;T***(v) € W. 
i=0 


(b) Let U be a T-invariant subspace of V containing v. Since it’s T- 
invariant, we know that T(v) is an element in U. Inductively, we 
know that T*(v) ¢ U for all nonnegative integer k. By Theorem 1.5 
we know that U must contain W. 


12. Because W is a T-invariant subspace, we know that T(v;) ¢ W for vu; € +. 
This means the representation of T(v;) corresponding the basis 8 only use 
the vectors in y for each vu; € y. So one corner of the matrix representation 
would be zero. 


13. If w is an element in W, it’s a linear combination of 
{v,T(v),T°?(v),.... 
So w = g(T)(v) for some polynomial g. Conversely, if w = g(T)(v) for 


some polynomial g, this means w is a linear combination of the same set. 
Hence w is an element in W. 


14. It’s because that the set 
DO) TAB, ace 0) 


is a basis of W by Theorem 5.22, where k is the dimension of W. 
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15. 


16. 


17. 


18. 


The question in the Warning is because the definition of f(A) is not 
det(A- AI). To prove the version for matrix, we may apply the theorem 
to the linear transformation L.4. So we know that the characteristic poly- 
nomial of L.4 is the same as that of A, say f(t). And we get the result 
f(La) =To by the theorem. This means 


f(A)u = f(La)(v) = 0 
for all v. So we know f(A) =O. 


(a) By Theorem 5.21 we know that the characteristic polynomial of the 
restriction of T to any T-invariant subspace is a factor of a polynomial 
who splits. So it splits, too. 


(b) Any nontrivial T-invariant subspace has dimension not equal to 0. 
So the characteristic polynomial of its restriction has degree greater 
than or equal to 1. So it must contains at least one zero. This means 
the subspace at least contains one eigenvector. 


If we have the characteristic polynomial to be 
f(t) = (-1)"#" + an_it” | +--+ a0, 
then we have 
f(A) = (-1)"A" + an. A") +--+ ag =O 


by Cayley-Hamilton Theorem. This means that A” is a linear combination 
of I, A, A?,..., A+. By multiplying both sides by A, we know that A”*+ 


is a linear combination of A, A?,..., A". Since A” can be represented as 
a linear combination of previous terms, we know that A”*! could also be 
a linear combination of J, A, A?,...,A"~+. Inductively, we know that 


span{I, A, A”,...} =span{I, A, A’,..., A" +} 
and so the dimension could not be greater than n. 


(a) See Exercise 5.1.20. 


(b) For simplicity, denote the right hand side to be B. Directly calculate 
that 


AB (1a a eA hae al A) 


so, 1 
aes 
1 
-—(-aol) = I. 
ag 
(c) Calculate that the characteristic polynomial of A is 


-t3 +207 +¢-2. 
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So by the previous result, we know that 


Al= pica? + 2Ael 


#27 20 

il. 3 

=|9 35 § 
GO “0: <4. 


19. As Hint, we induct on k. For k = 1, the matrix is (-ao) and the char- 
acteristic polynomial is -(ag + t). If the hypothesis holds for the case 
k = m-1, we may the expand the matrix by the first row and calculate 
the characteristic polynomial to be 


-t 0 —do 

det(A-tr)=det| > — 

0 0 -—Am-1 
Sf A 0y eh ae 1 t- 0 
=-tdet]} . oe i ae + (-1)'™*!(—ag) det a Bi 
Gi. Os Ae Sa ba a 


= -t[(-1)" (aq + agt + + amt 2 + 4™1)] + (-1)™ag 


= (-1)™ (ag + apt + + + Git 1 +2). 
20. If U = g(T), we know that UT = TU since 
TL") = (Pr = prt 


and T is linear. For the converse, we may suppose that V is generated by 
v. Then the set 

B ={v,T(v),T?(v),...,T*(v)} 
is a basis. So the vector U(v) could be written as a linear combination of 
8. This means U(v) = g(T)(v) for some polynomial g. Now if UT = TU, 
we want to show U = g(T) by showing U(w) = g(T)(w) for all w € £. 
Observe that 


U(T™ (v)) = T™(U(v)) = Tg T)(v) = (T)T™(v)) 
for all nonnegative integer m. So we get the desired result. 


21. If we have some vector v € V such that v and T(v) are not parallel, we 
know the T-cyclic subspace generated by v has dimension 2. This means 
V is a T-cyclic subspace of itself generated by v. Otherwise for every 
uv €V, we may find some scalar \, such that T(v) = Ayv. Now, if Ay is a 
same value c for every nonzero vector v, then we have T = cl. If not, we 
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22. 


23. 


24. 


may find \, # A, for some nonzero vector v and u. This means v and u 
lies in different eigenspace and so the set {u,v} is independent. Thus they 
forms a basis. Now let w=u+u. We have both 


T(w) = Aww = Aw + AwU 


and 
T(w) = Av + Ayu. 


By the uniqueness of representation of linear combination, we must have 
Av = Aw = Au; 
a contradiction. So in this case we must have T = cl. 


If T + cI, then V must be a T-cyclic subspace of itself by Exercise 5.4.21. 
So by Exercise 5.4.20, we get the desired result. 


As Hint, we prove it by induction on k. For k = 1, it’s a natural statement 
that if v; isin W then v; is in W. If the statement is true for k = m-1, 
consider the case for k = m. If we have u = v1 + vg +--+ Um is in W, a 
T-invariant subspace, then we have 


T(u) = Az, v1 + Agqve +++ + AmUm € W, 


where 4; is the distinct eigenvalues. But we also have \,,u is in W. So 
the vector 


T(u) = AmU = (Ai Zn Am)U1 + (\2 = Am)v2 tee t (Am-1 = Am)Um-1 


is al so in W. Since those eigenvalues are distinct, we have A; — Ay, is not 
zero for alli +m. So we may apply the hypothesis to 


1 = Am)U15 (2 - Am) v2 a Cer ~ Am)Um-1 


and get the result that (Aj-Am)v; is in W and so v; is in W for all i #_m. 
Finally, we still have 


Um = U- V1 — VQ - ++ — Um-1 € W. 


Let T be a operator on V and W be a nontrivial T-invariant subspace. Also 
let Ey, be the eigenspace of V corresponding to the eigenvalue ’. We set 
W) = E, AW to be the eigenspace of Ty corresponding to the eigenvalue 
». We may find a basis 8), for W) and try to show that 6 = U)8) isa 
basis for W. The set @ is linearly independent by Theorem 5.8. Since T’ 
is diagonalizable, every vector could be written as a linear combination of 
eigenvectors corresponding to distinct eigenvalues, so are those vectors in 
W. But by the previous exercise we know that those eigenvectors used to 
give a linear combination to elements in W must also in W. This means 
every elements in W is a linear combination of 6. So 6 is a basis for W 
consisting of eigenvectors. 


139 


25. (a) Let Ey be a eigenspace of T corresponding to the eigenvalue \. For 
every uv € Ey we have 


TU(wv) = UT(v) = AU (v). 


This means that EF) is an U-invariant subspace. Applying the pre- 
vious exercise to each Ey, we may find a basis 6) for Ey such that 
[Ur, ]a, is diagonal. Take 6 to be the union of each 8) and then 
both [T], and [U], are diagonal simultaneously. 


(b) Let A and B are two nx n matrices. If AB = BA, then A and B 
are simultaneously diagonalizable. To prove this we may apply the 
version of linear transformation to L4 and Lp. 


26. Let {v1, v2,...,Un be those eigenvectors corresponding to n distinct eigen- 
values. Pick 
V= Uz +t VQ +0 + Un 


and say W to be the T-cyclic subspace generated by v. By Exercise 5.4.23 
we know that W must contains all v;’s. But this means the dimension of 
W is n and the set 

{v,T(v),... A des Co 


is a basis by Theorem 5.22. 


27. (a) Ifu+W =v'+W, we know that v—v’ is an element in W by Exercise 
1.3.31. Since W is T-invariant, we have 


T(v) -T(v') =T(v-v’) 
is in W. So we have 
T(v)+W=T(v')+W 


and this means - - 
T(v+W)=T(v' +W). 


(b) Just check that 
T((vt+v')+W)=T(vtv')+W 
=(T(v)+W)+(T(v')+W) =T(v+W)+T(v' +W) 
and 
T(cu+ W) =T(cv) + W =c(T(v) + W) = cT(v+ W). 
(c) For each v € V we might see 


nT (v) =T(v)+W =T(v+W) =Tn(v). 
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28. 


29. 


30. 


We use the notation given in Hint. Since W is T-invariant, we know the 
matrix representation of T' is 


As the proof of Theorem 5.21, we know that f(t) = det([T],- tl) and 
g(t) = det(B; - tI). It’s enough to show h(t) = det(B3 - t) by showing 
Bs is a matrix representation of T. Let 


a = {up + W, vp + W,..-,Un + W} 


be a basis for V/W by Exercise 1.6.35. Then for each 7 = k,k+1,...,n, 
we have _ 
P(vj) = T(vj) +W 


n 


k 
= ie (B2)ijv + y (B3)ijvil +W 
l=1 i=k4+1 


n 


= SS (B3)ijvi+ W = > (B3)ij(ui + W). 


t=k+1 t=k+1 
So we have B3 = [T] and 
f(t) = det([T]g - tI) = det(B, - tL) det(Bs - tL) = g(t)h(t). 

We use the notation given in Hint of the previous exercise again. By 
Exercise 5.4.24 we may find a basis y for W such that [Ty], is diagonal. 
For each eigenvalue 4, we may pick the corresponding eigenvectors in 
and extend it to be a basis for the corresponding eigenspace. By collecting 
all these basis, we may form a basis 8 for V who is union of these basis. 


So we know that [T']g is diagonal. Hence the matrix B3 is also diagonal. 
Since we’ve proven that B3 = [T]., we know that T is diagonalizable. 


We also use the notation given in Hint of the previous exercise. By Exercise 
5.4.24 we may find a basis 


Y= {U1,02,---,UKE} 
for W such that [Tw], is diagonal and find a basis 
= {Une + W, ps2 + W,...,0n + W} 


for V/W such that [T], is diagonal. For each v + W ¢€ V/W, we know 
that v+ W is an eigenvector in V/W. So we may assume that 


T(v)+W=T(v+W)=d\0+W 
for some scalar \. So this means that T(v)—Av is an element in W. Write 


T(v) = Au + a1 U1 + GgQUg +++ + AkUE, 
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3l. 


where v,’s are those elements in y corresponding to eigenvalues \;’s. Pick 


ai 
c; to be ae and set 


vl =UF C101 + Cova to + CKUR- 


Those c,’s are well-defined because Tw and T has no common eigenvalue. 
Thus we have 


T(v') = T(v) + eT (v1) + oT (v2) ++ + cK T (vg) 
=)\v+ (a1 + c1A1) v1 # (a2 + c2A2)v2 abet 2 (az + CeAk)Uk) 
= )v' 


and v+W =v'+W. By doing this process, we may assume that v; is an 
eigenvector in V for each vj +W ¢ a. Finally we pick 


B = {v1,02,---,Un} 
and show that it’s a basis for V. We have that y is independent and 
5 = {Up41, Vet2,+++,Un} 


is also independent since 7(6) = a is independent, where 77 is the quotient 
mapping defined in Exercise 5.4.27. Finally we know that if 


Ops1Ver1 + Ap+2VK42 + °AnUn € W, 


then 
(Gg41UE41 + Ak+2VK42 +°AnUn) +W =04+W 


and so 


Ok+1 = Ak42 =" = An = 0. 


This means that span(d)m W = {0}. So V is a directed sum of W and 
span(d) and 6 is a basis by Exercise 1.6.33. And thus we find a basis 
consisting of eigenvectors. 


(a) Compute 


1 0 
T(e1) =| 2],77(e1) = | 12 
1 6 


and 
T?(e1) = —6e, + 6T(e1). 


This means that the characteristic polynomial of Ty is t? — 6t + 6. 
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(b) We know that 
dim(R?/W) = 3-dim(W) =1 


by Exercise 1.6.35. So every nonzero element in R?/W is a basis. 
Since e2 is not in W, we have e2 + W is a basis for R?/W. Now let 
B = {e2+W}. We may compute 


1 
T(eg+W)=|3]+W=-e.+W 
2 


and [T], = (1). So the characteristic polynomial of T is -t - 1. 


(c) Use the result in Exercise 5.4.28, we know the characteristic polyno- 
mial of T is 
-(t+1)(t? - 6t +2). 


32. As Hint, we induct on the dimension n of the space. For n = 1, every matrix 
representation is an uppertriangular matrix. Suppose the hypothesis is 
true for n = k—1. Now we consider the case n = k. Since the characteristic 
polynomial splits, 7 must has an eigenvector v corresponding to eigenvalue 
d. Let W = span({v}). By Exercise 1.6.35 we know that the dimension of 
V/W is equal to k-1. So we apply the induction hypothesis to T defined 
in Exercise 5.4.27. This means we may find a basis 


a = {ug + W,ug + W,...,un+W} 


for V/W such that [T], is an upper triangular matrix. Following the 
argument in Exercise 5.4.30, we know that 


B= {v, U2, U3,..., UR} 


is a basis for V. And we also know the matrix representation is 


which is an upper triangular matrix since [T’], is upper triangular. 


33. Let 
W=W1+ Wote+ We 


for some w; € W;. Thus we have T(w;) € W; since W; is T-invariant. So 
we also have 


T(w) = T(w1) +t T (we) ap kee T (we) €W, + Wo+---We. 


34. In the next exercise we prove this exercise and the next exercise together. 
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35. 


36. 


37. 


38. 


39. 


40. 


Let v is an element in 6. We know that T(v) is a linear combination of 
8. Since T(v) is in W1, we know that the combination only use elements 
in 6,. The same argument could be applied to each element in (@; for each 
i. So we know the matrix representation of T corresponding to the chosen 
basis ( is 

B, @ By @-:: ® Bx. 


First observe that a one-dimensional T-invariant subspace is equivalent 
to a subspace spanned by an eigenvector. Second, V is the direct sum of 
some one-dimensional subspaces if and only if the set obtained by choosing 
one nonzero vector from each subspace forms a basis. Combining these 
two observation we get the result. 


Use Theorem 5.25 and its notation. We have 
det(T’) = det(B) = det(B)) det( B2)---det( By) 
= det (Tw, ) det (Tw, Jer det (Tw, ) : 


By Exercise 5.4.24, we already get the necessity. For the sufficiency, we 
may pick bases 3; for W; such that [Ty,] is diagonal. Combining these 
bases to be a basis 6 = U%_, 6; for V. By using Theorem 5.25, we know 
that Tis diagonalizable. 


By Exercise 5.2.18, we already get the necessity. For the sufficiency, we 
use induction on the dimension n of the spaces. If n = 1, every operator on 
it is diagonalizable by the standard basis. By supposing the statement is 
true for n < k-1, consider the case n = k. First, if all the operator in C has 
only one eigenvalue, then we may pick any basis 3 and know that [T], is 
diagonal for all T ¢ C. Otherwise, there must be one operator T possessing 
two or more eigenvalues, 41, A2,...,Az. Let W; be the eigenspace of T 
corresponding to the eigenvalue 4;.. We know that V is the direct sum 
of all W;’s. By the same reason in the proof of Exercise 5.4.25, we know 
that W; is a U-invariant subspace for all U ¢ C. Thus we may apply the 
induction hypothesis on W; and 


C; = {Uw, :U €C}. 


Thus we get a basis 6; for W; such that [Uw,]g, is diagonal. Let 8 be the 
union of each 6;. By applying Theorem 5.25, we get the desired result. 


Observe that 
A-tI =(B,-tl) ®(B.-tlI) ®--@(B, - tI). 


So we have 


k 
det(A - tI) =|] det(B; - tZ). 
t=1 
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42. 


Let v; be the i-th row vector of A. We have {v1, v2} is linearly independent 
and v; = (¢-1)(v2—v1) +1. So the rank of A is 2. This means that t”~? 
is a factor of the characteristic polynomial of A. Finally, set 


8={(,1,...,1),(,2,...,n)} 


and check that W = span() is a L4-invariant subspace by computing 


1 1 1 

r } - (mat 1) sa) : 8 Z 

1 1 n 

and 
1 1 1 
ii 2 ay kiwee eae) Eo 1 m+ 2 
: 6 2 : 2 

n 1 n 


So we know the characteristic polynomial is 


n?(n+1) _t 


2 


n(n+1) 2_ 4 (ntl) (2n+1) n?(n+1) 
2 
n 


Corea ae 


= (-1)"1" 12¢? + (-6n3 - 6n)t- n° +3 
12 

But this is the formula for n > 2. It’s natural that when n = 1 the char- 

acteristic polynomial is 1-¢. Ur...I admit that I computed this strange 

answer by wxMaxima. 


Observe that the nullity of A is n- 1 and det(A-nJ) = 0 because the 
sum of all column vectors of A-nI is zero. So we know the characteristic 
polynomial of it is 

Gly" —n). 
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Chapter 6 


Inner Product Spaces 


6.1 Inner Products and Norms 


1. (a) Yes. It’s the definition. 
(b) Yes. See the paragraph at the beginning of this chapter. 
(c) No. It’s conjugate linear in the second component. For example, in 
C with standard inner product function we have (i,7) = 1 but not 
i(i,1) =-1. 
(d) No. We may define the inner product function f on R to be f(u,v) = 
2uuv. 
(e) No. Theorem 6.2 does not assume that the dimension should be 
finite. 
(f) No. We may define the conjugate-transpose of any matrix A to be 
the conjugate of A®. 
(g) No. Let x = (1,0), y= (0,1), and z = (0,-1). Then we have (x, y) = 
(x, z) =0. 
(h) Yes. This means ||y|| = (y, y) = 0. 


2. Compute 
(x,y) =2-(2+i)+(1+i)-24+i-(1- 2i) = 845i, 
| xl) = (w,2)2 = (44241)? = V7, 
lvl = (y.y)? = (54445)? = V4, 
and 


|x +y| = |(4-4,3 +4, 1 + 32) = (17+ 10 + 10)? = V37. 


We have the Cauchy-Schwarz inequality and the triangle inequality hold 
since 


|8 + 5i| = V89 < V7V'14 = V119 
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and 


V7 +V14 > V37. 


3. As definition, compute 


(f.ar= f° fbo(eat= f tae! =te'\h— f eldt=e-(e-1)=1, 


ay ae ee 
l= f= 5, 
t 
lol= f= 5(2-0, 


[f+ ol = le+ ell = (FI? + 2(f,9) + Ig l?)? 


and 


1 1 4 _ 18¢? 
= —+24+-—(e4- 2e7 +1) = calle ey 
9 4 36 
And the two inequalities hold since 
1 1 
ie ee =4 
<3 5 ie ) 
and i 5 
9e* — 18e Beene 24), 


€ 
36 3 2 
4. (a) We prove the formula 


(A, B) = tr(B* A) = >) (B* Ay = YY Bi Ag 
j=l j=li=l 
=<St * Ai Bij =yy Aj, Bi; = > Aig Bay- 
jali=l i=1j=1 ag 


So we may view the space Myxn(F) to be F”’ and the Frobenius 
inner product is corresponding to the standard inner product in F’ ne 


(b) Also use the formule to compute 
| Aj = (1454941)? =4, 


|B =(2+04+1+1)2 =2, 


and 
(A, B) = (1-4) +0-3i-1=-4i. 


5. We prove the condition for an inner product space one by one. 


(a+ z,y) = (a+ 2z)Ay* = 2AY* + zAy* = (x,y) + (z,y). 
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(cx, y) = (er) Ay" = c(wAy") = cx, y). 


(x,y) = (wAy")” = yA*a® = yAa® = (y, 2). 
One of the equality use the fact A= A”. 


(x, x) = (41,22) A(a1, £2)* = |laeq||? + ia FQ — ixgFq + 2\|29||? 
= |21||? + 2Re(ix1Fq) + 2\|-r2\|7)0 


if x1 or %2 is not 0. Here the function Re(z) means the real part of 
a complex number z. 


So it’s an inner product function. Also compute that 


(x,y) = 1(1-4)(2-4) + i(1-4)(3+ 2%) + (-i)(2 + 34) (2-1) + 2(2 + 31)(3 +22) 


= (1-2)(22) + (2+ 32)(5 + 27) = 6+ 21%. 


(a) 
(x,y + 2) = (y+ 2,2) = (y, x) + (2,2) = (x,y) + (x, z). 
(b) 
(x, cy) = (cy, x) = cly, x) = C(x, y). 
(c) E 
(x,0) = 0(x,0) =0 
and 


(0, x) = (a, 0) =0. 
(d) If « = 0, then (0,0) = 0 by previous rule. If « #0, then (a,x) > 0. 


(e) If (x,y) = (a, z) for all a ¢ V, we have (x, y- z) =0 for alla eV. So 
we have (y- z,y- z) =0 and hence y- z = 0. 


(a) 


Ica = (cx, cx)® = (cea, x))? = |ellja]. 
(b) This is the result of ||z|| = («,a)? and Theorem 6.1(d). 


(a) The inner product of a nonzero vector (1,1) and itself is 1-1 = 0. 
(b) Let A= B= Ig. We have 


(2A, B) =342(A, B) =4. 


(c) The inner product of a nonzero function f(x) = 1 and itself is 


1 
Ei Gide Lo: 
0 
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9. 


10. 


11. 


12: 


13. 


(a) Represent x to be a linear combination of ( as 


and so x = 0. 
(b) This means that (x - y, z) = 0 for all z € 8. So we have x - y =0 and 
D=y. 


Two vectors are orthogonal means that the inner product of them is 0. So 
we have 


|c+yl? =(a+y,0+y) 
= xl? + (x,y) + (yx) + [yl]? = fol? + +My. 


To deduce the Pythagorean Theorem in R?, we may begin by a right 
triangle ABC with the right angle B. Assume that « = AB and y= BC. 
Thus we know the length of two leg is ||| and |y||. Finally we know 
AC = «+~y and so the length of the hypotenuse is ||z + yl]. Apply the 
proven equality and get the desired result. 


Compute that 
jx + yl? + a - yl)? 


= (fell? + (x,y) + (yx) + Hyll?) + (al? - (2,9) - (y,2) + yl?) 
= Ql]? + 2[yl. 


This means that the sum of square of the four edges of a prallelogram is 
the sum of square of the two diagonals. 


Compute that 


k k 
| YS av; = ‘om a;V;) 
i=1 i=1 


k 
=¥ law |? + > (ai0;, 050) 
i=l 


ij 


Check those condition. 
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(v+z,y)=(rt+z,y)1 t+ (v+z,y)2 
= (x,y)1 + (z,y)1 + (x, y)2 “F (z,y)2 
= (x,y) + (z,y). 


(ca, y) = (cx, y)1 + (cx, y)2 
= (x, yi + (x, y)2 = e(x,y). 


(z,y) = (r,Y)1 + (x, y)2 
me (y,@)1 + (y,X)2 = (y, 2). 


(x, x) = (x,x)1 +(x, 2)2>0 
if x #0. 


14. Check that 


(A+ cB);; = (A+ cB) 5; = Aj; + CBs; = (A* HEB” \eae 


15. (a) If one of z or y is zero, then the equality holds naturally and we have 
y = Ox or « = Oy. So we may assume that y is not zero. Now if x = cy, 
we have 

a, y)l = Key, yl = lellly||? 
and 
Jz] - yl = levll Yall = lellial?- 
For the necessity, we just observe the proof of Theorem 6.2(c). If the 
equality holds, then we have 


|x - cyl] = 0, 
where 
ow EY) 
(yy) 
And so © = cy. 
(b) Also observe the proof of Theorem 6.2(d). The equality holds only 
when 


Re(x,y) = |x, y)| = [al - ly. 


The case y = 0 is easy. Assuming y # 0, we have x = cy for some 
scalar ce F’. And thus we have 


Re(c)|lyll? = Re(cy,y) = Key. yl = lel- lull? 
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and so 
Re(c) = |e]. 


This means c is a nonnegative real number. Conversely, if x = cy for 
some nonnegative real number, we may check 


z+ yl =le+ Illy = (e+ 1) Iyll 


and 
zl + Iyll = lellyll + Iyll = (e+ Dy. 


Finally, we may generalize it to the case of n vectors. That is, 
v1 +g +--+ ep] = [ail] + [eo +--+ lanl 


if and only if we may pick one vector from them and all other vectors 
are some multiple of that vector using nonnegative real number. 


16. (a) Check the condition one by one. 


(Frhg)= sf" (Fe) +h)g@Bat 


20 20 
= 5 [pat + 5 [ (ngat = (F.9) + (ha. 
1 20 
(cfg)=5- [ efgat 


=o f" Fat) = (F.9). 


1 27 20 


27 
-> f gfdt = (g, f). 


2n 
(f= sf LPat> 0 


if f is not zero. Ur... I think this is an exercise for the Adavanced 
Calculus course. 


(b) No. Let 
0 if a< 


i 
r= { 94 if >t 
Then we have that (f, f) =0 but f #0. 


17. If T(x) = 0 Then we have 
Ix] = 0] = 0. 


This means x = 0 and so T is injective. 
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18. If (-,-)’ is an inner product on V, then we have that T(x) = 0 implies 
(x, x)" =(T(x),T(x)) =0 


and « = 0. So T is injective. Conversely, if Tis injective, we may check 
those condition for inner product one by one. 


(w+ 2,y)' = (T(x + 2), T(y)) = (T(x) + T(z), Ty) 
= (T(x), T(y)) + (T(z), T(y)) = (ey) + (2,9). 
(cx, y)’ = (T (cx), T(y)) = (cT (x), T(y)) 
= (T(x),T(y)) = (2,9) 

(x, y)' = (T(a),T(y)) 
= (T(y),T(2)) = (y, ay’. 


(x, x)’ = (T(x),T(x)) >0 


if T(a) #0. And the condition T(x) # 0 is true when z #0 since T is 
injective. 


19. (a) Just compute 
Jct yl? = al? + (x,y) + (ya) + fy? 


= al? + (x,y) + (yx) + [yll? = lal? + Re((x,y)) + lal? 


and 
Jz — yll? = Jal? — (2,9) —(y, 2) + Wy IP 
= |x|? - (x,y) - (y,2) + |y? = a]? - Re((z,y)) + ly’. 
(b) We have 
lz-yll + lyl = [el 
and 


Ie yl + lel = ly- al + le] 2 Iyl- 
Combining these two we get the desired inequality. 


20. (a) By Exercise 6.1.19(a), we have the right hand side would be 


7 (4Re(2,y)) = (2.0). 


The last equality is due to the fact F = R. 
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(b) 


21. (a 


ae 


22. (a) 


Also use the Exercise 6.1.19(a). First observe that 
jx + i yl? = lal? + Relax, iy) + e*yll? 


= |x|? + 2Re[i*(x, y)] + |yl?. 
Assuming (x,y) to be a+ bi, we have the right hand side would be 


4 4 = 4 
10) i*) Jal? +2 0° *Re[i*(x,y)] + (> **) | y17] 
k=l k=1 k=1 


= 5 [bi + ( a)(-1) + (-b)(-7) + a] = a+ bi = (x,y). 


Observe that 
Lr a ery eer : 
A= (5(A+ A*))" = 54" +A) = At, 
* 1 *\\* 1 * * 
Ap = (5 (A A*))" = (-—(A* - A) = A. 
4 27 
and 


Ay +iAg= 5(At A") + (A- At) = (At A= A, 


I don’t think it’s reasonable because A, does not consists of the real 
part of all entries of A and A, does not consists of the imaginary 
part of all entries of A. But what’s the answer do you want? Would 
it be reasonable to ask such a strange question? 


If we have A = B, +iBy with BY = B, and B3 = Bo, then we have 
A* = BY -iBz. Thus we have 


By = 5(A+ A*) 


and i 
By =—(A- A’). 
21 


As definition, we may find v1, v2,...,Un € 8 for x,y,z € V such that 


n n n 
r= yy avi, y = > divi, 2 = >: d;V;.- 
i=l i=l i=l 


And check those condition one by one. 


(a + z,y) = >» (a; + ai)uu, bee) 


= 3 (a; + di )b; = SG: +f Sab 
i=1 t=1 i=1 


7 (x,y) + (2,y)- 
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23. 


(cx,y) = >» (aii, bie) 


=) (cao: =e). a:b; = clay). 
i=l i=l 


e 
(x,y) = > abe 
a=1 
= >> aid; = >) biG = (y, x) 
i=l i=1 
e 


(a,x) = > |a,|? >0 
i=l 


if all a,’s is not zero. That is, x is not zero. 
(b) If the described condition holds, for each vector 


n 
v= >, ALU; 
i=1 


we have actually a; is the i-th entry of x. So the function is actually 
the atandard inner product. Note that this exercise give us an idea 
that different basis will give a different inner product. 


(a) We have the fact that with the standard inner product (-,-) we have 
(x,y) =y*x. So we have 


(x, Ay) = (Ay)"a = y"A*a = (A*a, 9). 
(b) First we have that 
(A"x,y) = (x, Ay) = (Bx, y) 
for all x,y¢ V. By Theorem 6.1(e) we have A*x = Bx for all x. But 
this means that these two matrix is the same. 
(c) Let 6 = {v1,v2,...,Un}. So the column vectors of Q are those v;’s. 
Finally observe that (Q*Q);; is 
Pe eae ee ea 
OE MDa) -| 0 if i#}. 
So we have Q*Q =I and Q* = Q'I. 
(d) Let @ be the standard basis for F”. Thus we have [T], = A and 


[U], = A*. Also we have that actually [J]8 is the matrix Q defined 
in the previous exercise. So we know that 


(U]e = [118 [U]a[I]3 = QAQ™' = QAQ* 
= (QA*Q")* = [18 [T a2] = [T]e. 
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24. Check the three conditions one by one. 


(a) 
|| Al] = max|Aj;;| > 0, 
UJ 


and the value equals to zero if and only if all entries of A are 


Zero. 
e 
aA] = max |(aA) 55] 5 max lal] Ais| 
= lal max| Ais] = lal]. 
e 
| A+ Bl = max|(A + B)ij| = max|Ai; + Bay| 
tJ tJ 
< max|4ij|+max| Bil = LA] + [BL 
(b) 
= t)|>0 
Is meee, 
and the value equals to zero if and only if all value of f in [0,1] 
is zero. 
e 
= = t 
Jaf = max |(af)(6)|= max lal, f(0) 
= t)} = ; 
jal ax |F(¢)| = lal 
e 


If + gl = tee (f+ a) (I= ie lf(t) + g(t)| 


< t Aye 
pees If ( ) ee lg) = Nfl + Tall 


1 
lvl= fo iF@lae> 0, 


and the value equals to zero if and only if f =0. This fact depend 
on the continuity and it would be an exercise in the Advanced 
Calculus coures. 


lasl= [ larcolar= f lallrcolat 
=lal {LCI lal 
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iftal= f r@+ocides [Fl +loolat 
=f is@lat+ fo lo(elae= Fl + bol. 


(a, b) || = max¢|a], ||} 2 0, 


and the value equals to zero if and only if both a and b are zero. 


I|c(a, 6) || = max{|cal, |cb]} = max {|cllal, |cllo|} 
= |e] max{|a], ||} = |ell|(@, ) |. 


| (a,b) + (¢,d)|| = max{|a + c|,|b + dl} < max{|al + |e], [| + |d]} 
< max{|a], |b/} + max{|c|, |d|} = ||(a, 6)|| + (c,@) |. 


25. By Exercise 6.1.20 we know that if there is an inner product such that 
\a|? = (a, a) for all a € R?, then we have 


1 1 
(v,y) = Zhe + ul? - Fhe. 
Let x = (2,0) and y = (1,3). Thus we have 


(0,9) = 71,3)? - Z1G,-3)9? = 0 


and 


(22,4) = 71(5,3)1? - 5103.8) ? = 25-9) = 4. 


This means this function is not linear in the first component. 


26. (a) 
d(x, y) = |v — yl 2 0. 
(b) 
d(x, y) = |x - yl = lly - al] = dy, 2). 
(c) 


d(x,y) = |e-yl = (@-2)+(2-y)Il s a-z[+ [2-yl = da, y) +d(z, y). 


(4) 
d(x, x) = lle al = 0] = 0. 
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(e) 
U(x, y) = |v -yl| > 0 


if x-y is not zero. That is, the distance is not zero if x + y. 


27. The third and the fourth condition of inner product naturally hold since 


(x,y) = slle+ yl’ - Iz - yl] = (v2) 


and 


(x, x) = “Ul221? | 07] = ||)? > 0 
if c #0. Now we prove the first two condition as Hints. 
(a) Consider that 
Jz + Qy|? + al]? = 2I)x + yll? + ly? 
and 
jx — 2y|? + xl]? = 2l)x - yll? + ly? 


by the parallelogram law. By Substracting these two equalities we 


have 
Ix + 2y|? - jw - Qyl|? = 2a + yl]? - 


iw) 


Je y?. 


And so we have 


(0, 2y) = Fle + Qu)? - [a - 2y 7] 
1 2 a 
= 5 [2le + yl? - 2h —yl?] = 2¢2,9). 
(b) By the previous argument, we direct prove that 
(a + u, 2y) = 2(a, y) + 2(u, y). 
Similarly begin by 
z+ ut Qyl? + Ja —ul? = 2a + yl]? + 2a + yl? 


and 
a+ w— Qyl|? + lla - ull? = 2I)a — yll? + Qu — yl? 


by the parallelograom law. By substracting these two equalities we 
have 


c+ u+ yl? - jot u-2Qy|]? = Zle+y|?-2lx-yl]?+2luty|?-2lu-yl?. 
And so we have 


1 
(x + u,2y) = F[e +u+ 2yl? - |x + uy] 


al 
1 2 2 2 2 
= 7a + all - 2a —yl" + Ql + yl - 2fu- yl] 


= 2(x,y) + 2(u,y). 
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(c) Since n is a positive integer, we have 
(na,y) = ((n—1)a,y) + (x,y) 


= ((n- 2)a,y) + 2(x,y) = + = n(x, y) 
by the previous argument inductively. 


(d) Since m is a positive integer, we have 


(x,y) = (m(—<2),y) = (2,9) 


by the previous argument. 


(e) Let r = ® for some positive integers p and q if r is positive. In this 
case we have 


(r,y) = (p(=2),y) = p(a2,y) 
q q 


=F (x,y) =r(x,y). 
q 


If r is zero, then it’s natural that 


(0,4) = sll? lll] = 0 = 0(2,y). 


Now if r is negative, then we also have 


(ra, y) = ((-r)(-), y) = -r(-2, 9). 


But we also have 
(-2,y) = —(x,y) 
since 
(=2,9) + (x,y) 
1 
= lied yl? -|-2-yl? + |x + yl? - |e—yl?] = 0. 
So we know that even when r is negative we have 


(ray) = —r(—a,y) = r(x, y). 


(f) Now we have the distribution law. Also, the position of the two com- 
ponent can be interchanged without change the value of the defined 
function. Finally we observe that 


1 
[lla + x] - Jol?) = [al]? 


(x, x) = 4 


for all x € V. So now we have 


lc]? + 2,4) + ly? = Ie + yl? 
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< (Jel* yl)? = lel? + lel - Ly + hell? 


by the triangle inequality. Similarly we also have 
Jj]? — 2a, y) + |yll? = |e — yl? 
< (a* | - yll)? = Jal? + 2] yl + byl?- 
Thus we get the desired result 
(x, y)] < lal - yl. 


(g) Since 
(c-r)(x,y) = (x,y) — r(x, y) 


and 


((c (eu) ~ (cx Tx, y) = (cx, y) r(z,Yy), 
the first equality holds. And by the previous argument we have 
-le-rllzllyl s (e-r)(x,y), ((e-r)a,y) < le-r|la] [yl 


and so we get the final inequality. 


(h) For every real number c, we could find a rational number such that 
lc-r| is small enougl{"| So by the previous argument, we have 


(cx, y) = (x,y) 
for all real number c. 
28. Check the conditions one by one. 


[x + z,y] 7 Re(x + z,Y) = Re((z, y) + (z,y)) 
= Re(z, y) + Re(z,y) = [vy] + [z,y]. 


[cx, y] = Re(cx, y) 
=cRe(x,y) = c[z,y], 


where c is a real number. 


[x,y] = Re(x,y) a Re(x, y) 
= Re(y, x) = [y, x]. 
[x, x] = Re(a,x) = (a,x) >0 
if x #0. 


1This is also an exercise for the Adavanced Calculus course. 
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Finally, we have [x, ia] = 0 since 
(x, iz) a —i{z, x) 
is a pure imaginary number. 


29. Observe that 


O=[a t+ ty,i(a + ty)] = [at ty, ia —- y] = [tv, ty] -[2,y]. 


So we have an instant property 


[x,y] = [éa, iy]. 
Now check the conditions one by one. 


e 
(ce+z,y)=[r+z,y]+2[x +z, iy] 


= [x,y] + [z,y] + i[x, iy] i i[z, ty] = (x,y) + (z,y). 


((a+ bi)z, y) = [(at bi)z, y] + i[(a + bi)z, iy] 
= [azx,y] + [biz, y] + 2[az, iy] + i[biz, iy] 
= a([x,y] + ix, iy]) + bi([ix, ty] - ix, y]) 
= a(x,y) + bi([a, y] + i[a,ty]) = (a + bt) (a, y). 
Here we use the proven property. 


(x,y) = [x,y] - i[z, iy] 
=[y, a] + ily, ta] = (x,y). 
Here we use the proven property again. 


(x, x) =[2,2]+i[x,ir] =[2,x]>0 


if x is not zero. 


30. First we may observe that the condition for norm on real vector space is 
loosen than that on complex vector space. So naturally the function | - || 
is still a norm when we regard V as a vector space over R. By Exercise 
6.1.27, we’ve already defined a real inner product [-,-] on it since the 


parallelogram law also holds on it. And we also have 


: 1 ? " 
[x, ix] = yl + ex” — |a - éa||?] = 
if vo 12 : 2 if 2 ; yy 2 
= qlle + eal” - Ia) (w+ tx)" ] = Fille + tall" —|- alll(x + tx)" ] = 0. 


So by Exercise 6.1.29 we get the desired conclusion. 
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6.2 The Gram-Schmidt Orthogonalization Pro- 
cess and Orthogonal Complements 


1. (a) No. It should be at least an independent set. 
(b) Yes. See Theorem 6.5. 


(c) Yes. Let W be a subspace. If x and y are elements in W* and cisa 
scalar, we have 


(x+y, w) = (x, w) + (y,w) = 0 
and 
(cx, w) = c(x,w) =0 
for all w in W. Furthermore, we also have (0, w) = 0 for all we W. 
(d) No. The basis should be orthonormal. 
(e) Yes. See the definition of it. 
(f) No. The set {0} is orthogonal but not independent. 
) Yes. See the Corollary 2 after Theorem 6.3. 


2. The answers here might be different due to the different order of vectors 
chosen to be orthogonalized. 


(a) Let 
S = {w, = (1,0,1), we = (0,1, 1), w3 = (1,3,3)}. 


Pick v, = w;. Then construct 


And then construct 
(ws, v1) (ws, U2) 5 
1 
vi? |v2\|? 
4 4 a 1 + 
w v v fay ; 
ae ee ae 


As the demand in the exercise, we normalize v1, v2, and v3 to be 


U3 = W3 


1 1 
Oe aa 5 

1 4 1 
U2 = ( Te ie 6: 
ee eee 1 
a aaa” 


), 
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Let 8 = {u1,u2,u3}. Now we have two ways to compute the Fourier 
coefficients of x relative to 8. One is to solve the system of equations 


ayUy, + A2QU2 + AZU3 = XL 


and get 


x ug + Ous. 


es Stang SO 
ae: 
The other is to calculate the i-th Fourier coefficient 


ay = (x, us) 


directly by Theorem 6.5. And the two consequences meet. 
Ur...don’t follow the original order. Pick w1 = (0,0,1), we = (0,1,1), 
and w3 = (1,1,1) and get the answer 


6 = {(0,0,1), (0, 1,0), (1,0, 0)} 


instantly. And easily we also know that the Fourier coefficients of x 
relative to 6 are 1,0, 1. 


(c) The basis is 


— 
og 
wa 


B = {1, V3(2e - 1), V5(6a? - 6x + 1)}. 


And the Fourier coefficients are 3, wa, 0. 
(d) The basis is 
p2{ 040) = aay 
4 >= st, i t,1—1, 40 é 
V2 2/17 


And the Fourier coefficients are ue V17i. 
(e) The basis is 
2 1 24 4 2 3 1 
5? 5? BBS V30' V30' 30° 30 
3 4 9 7 
( V155' 155’ 155" Visa 
And the Fourier coefficients are 10,330, 150. 
(f) The basis is 
1 2 1 3 2 2 1 1 
VIS 4/15. 3/15" Jia Fie’ V10' 10’ 10 
4 2 1 3 


), 


B={( )s 


Tan’ V0’ a0" Va0” 
i : ee ee 
And the Fourier coefficients are =e ia ae 
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(g) The basis is 


And the Fourier coefficients are 24, 6/2, ~9\/2. 
(h) The basis is 


2 de 5 2 8 a8 
o-(@ 8). ¥).(28 “48h 
JV1I3— (V13 (cums V373 V373 
And the Fourier coefficients are 5/13, -14, 373. 
(i) The basis is 


V2sin(t) V2cos(t) m-4sin(t) 8cos(t) + 2nt- 1? 


B ~ { ’ ’ ’ \, 
Va VT V1 — 81 = 397 
= 4 
, : Q(Qr+2 34 92_gq7-g 2548-16 
And the Fourier coefficients are Y2@2"+2) _ 4/2 x°+n*-an-8 em : 
FE I RPI SMEAR 5 EE BD 


(j) The basis is 
i 3 89 - ashe g 1. s¢e4 
{( E 3 aaa ) ) ’ 
22° 22° 25 V5 V5 V5 
fal i+3 5 5 )} 
2/35" \/35° 2\/35' 2/35 


And the Fourier coefficients are 6\/2, 4/5, 2/35. 
(k) The basis is 


), 


( 4 3-2 3 Ln 4i) (3-4 Bi 2i-1 i+2 
S47’ S47’ S47 VAT 2/15” 2/15? V5? 2/15 
in 17 Bi-9 Bi-9 81-9), 

2\/290’ 2\/290’ \/290’ 2,\/290°" 


And the Fourier coefficients are —/47i- 47, 4V/15i-2V/15, 2/2901 + 
2/290. 


(1) The basis is 


1-i =3i-2 3V2i = -i-1 —43i-2 1-211 
{ 2/10 2/10 5 5/2 5/323 5/323 } 
27+2 it+4 oT 1-37 741 ’ 68% 34% : 


2/10 2/10 5V2 5BV2 57323 5/323 


3 


( 


And the Fourier coefficients are 2\/10 — 6\/10#, 10\/2, 0. 
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(m) The basis is 


i-l1 i di -9i-11 -118i-5 —26i-7 
3/2 3/2 246 246 V39063 39063 

{ 2-4 Bit] Jol 524+] 1-1 ) 1457 58 ie 
2 V246 246 V39063 V39063 


And the Fourier coefficients are 3.\/2i + 6/2, —\/246i — \/246, 0. 


. Check that 6 is an orthonormal basis. So we have the coefficients of (3, 4) 


are 
1 1 7 


ee 
1 is 1 
On Cee 3 


. We may find the null space for the following system of equations 


(3,4) -( 


and 


(3,4) -( 


(a,b,c): (1,0,7) =a-ci=0, 
(a,b,c)-(1,2,1) =a+2b+c=0. 
So the solution set is S+ = span{(i,-}(1 +7), 1)}. 


. We may thought zo as a direction, and thus S@ is the plane orthogonal to 
it. Also, we may though the span of x1, x2 is a plane, and thus Sp is the 
line orthogonal to the plane. 


. Take X to be the space generated by W and {x}. Thus X is also a finite- 
dimensional subspace. Apply Theorem 6.6 to x in X. We know that x 
could be uniquely written as u+v with ue W and ve V+. Since x ¢ W, 
we have v #0. Pick y=v. And we have 


(x,y) = (v,v) + (u,v) = Jul? > 0. 


. The necessity comes from the definition of orthogonal complement, since 
every element in @ is an element in W. For the sufficiency, assume that 
(z,v) = 0 for all vu € 8. Since @ is a basis, every element in W could be 
written as 


k 
Yar, 
i=l 
where a; is some scalar and v; is element in G. So we have 
k k 
(z, ys a;V;) = > a5 (z,v;) =0. 
i=l i=l 


Hence z is an element in Wt. 


164 


10. 


11. 


12. 


. We apply induction on n. When n= 1, the Gram-Schmidt process always 


preserve the first vector. Suppose the statement holds for n < k. Consider 
the a orthogonal set of nonzero vectors 


{wi, wa, diay Wk}. 


By induction hypothesis, we know that the vectors v; = w; for 7 = 1,2,...,k- 
1, where v; is the vector derived from the process. Now we apply the pro- 
cess the find 


k-1 (wn v;) 
Une a De 
i=1 u 


= Wn -O0= Wn. 


So we get the desired result. 


. The orthonormal basis for W is the set consisting of the normalized vector 


(i,0, 1), {li, 0,1)}. To find a basis for W* is to find a basis for the null 
space of the following system of equations 


(a,b,c) - (i,0,1) = -ai+c=0. 


The basis would be {(1,0,2),(0,1,0)}. It’s lucky that it’s orthogonal. If 
it’s not, we should apply the Gram-Schmidt process to it. Now we get the 


orthonormal basis 1 


Wp 


by normalizing those elements in it. 


(1, 0,7), (0, 1,0)} 


By Theorem 6.6, we know that V = W@W* since Wn W+* = {0} by 
definition. So there’s a nature projection T on W along W~*. That is, we 
know tht every element x in V could be writen as u+v such that ue W 
and v € Wt and we define T(x) = u. Naturally, the null space N(T) is 
W+. And since u and v is always orthogonal, we have 


Jal? = Jul? + Jol? > ul? = ITC)? 
by Exercise 6.1.10. And so we have 
[Z7(@)| < let. 
Use the fact 
(AA")ig = (vi, 5) 
for all i and j, where vu, is the i-th row vector of A. 


If x € (R(Ly+))*, this means that x is orthogonal to A + y for all ye F”. 
So we have 
0 = (x, Avy) = (Aax,y) 
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for all y and hence Ax = 0 and a € N(La). Conversely, if  ¢ N(La), we 
have Ax = 0. And 
(x, A*y) = (Az,y) = 0 


for all y. So a is a element in (R(Ly4:))*. 


13. (a) If a € $+ then we have that x is orthogonal to all elements of S, so 
are all elements of 59. Hence we have x € 55. 


(b) If « € S, we have that x is orthogonal to all elements of S*+. This 
means « is also an element in (S+)+. And 


span(S) c (S*)* 


is because span(S) is the smallest subspace containing S and every 
orthogonal complement is a subspace. 


(c) By the previous argument, we already have that W c (W*)*. For the 
converse, if « ¢ W, we may find y ¢ W* and (z,y) #0. This means 
that W 3 (W*)?*. 


(d) By Theorem 6.6, we know that W=W+W+. And ifaeWnW?, 
we have 
(x, 2) = ||2|? = 0. 


Combine these two and get the desired conclusion. 

14. We prove the first equality first. If ¢ ¢ (W1+W2)*, we have x is orthogonal 
to ut+v for all ue Wy and v € W2. This means that x is orthogonal to 
u=u+0 for all wu and so z is an element in Wj. Similarly, x is also an 
element in W;. So we have 

(W, + W.)* Cc Wy. 
Conversely, if « ¢ Wi m Wz, then we have 
(xu, u) = (x,v) =0 
for all we W, and ve Wa. This means 
(a,u+v) = (x,u) + (x,v) =0 


for all element u+v¢eW ,+We,. And so 


(W, + W.2)* a Wi. 


For the second equality, we have ,by Exercise 6.2.13(c), 
(Win W2)* = (Wr)? 9 (We) 


= ((Wy + W3)")" = Wy + We. 
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15. (a) By Theorem 6.5, we have 


Thus we have 


(x,y) a » ({x, vi) Uz, (y, vj )v;) 


tJ 


= (2s) (ur 0a)e) 


= Sia 1"%(x,vi)y, vi). 


(b) The right hand side of the previous equality is the definition of the 
standard inner product of F”. 


16. (a) Let W =span(S). If u is an element in W, who is finite-dimensional, 
then by Exercise 6.2.15(a) we know that 


n 
Jul? = do tu, vP?. 
a=1 


Now for a fixed x, we know that W’ = span(W vu {2}) is finite- 
dimensional. Applying Exercise 6.2.10, we have T(z) « W and 
|Z'(x)|| < |||. This means 


|x|? = |T(@)I? = Dre). wP 


by our discussion above. Ultimately, by the definition of T, we have 
x =T(a)+y for some y who is orthogonal to all the elements in W. 
Thus we have 


(x, vi) = (T(x), vi) + (y, vi) = (T(2), vi). 
So the inequality holds. 
(b) We’ve explained it. 


17. First, since (T(x),y) = 0 for all y, we have T(x) = 0. Applying this 
argument to all 2 we get T(x) = 0 for all x. For the second version, by 
Exercise 6.1.9 we know that T(x) = 0 for all x in some basis for V. And 
this means T’= To by Theorem 2.6. 


18. Let f be an odd function. Then for every even funcion g we have fg is an 
odd function since 


fa(t) = f(-t)g(-t) = -F@)g9@). 


So the inner product of f and g is zero. This means Wz 2 Wo. 
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Conversely, for every function h, we could write h = f + g, where 


FO) = 5(H) +h) 


and i 
g(t) = 5(h(é) - R(-t)). 


If now h is an element in W;, we have 


O= (hf) = (FF) +49 F) = FIP 


since f is a even function. This means that f = 0 and h = g, an element 
in W,. 


19. Find an orthonormal basis for W and use the formula in Theorem 6.6. 


(a) Pick {Fe (1,4)} as a basis for W. Thus the orthogonal projection 
of u is 


1 1 26 
(u, ha pee = 7764): 


Vv V17 
(b) Pick {Fa(-3, 1,0), (2, 1,0)} as a basis for W. Thus the orthogo- 


nal projection of u is 

1 1 
—3,1,0) + (, —=(1,3,5)) = 
( ea ( Mas 


1 1 
(u, Fig ae Te 


Tig 7 (1, 3,5) 


1 4 1 
= -~(-3,1 -(1 = — (29,17, 40). 
5(-3,1,0) + 3(1,3,5) = = (29, 17,40) 


(c) Pick {1, oq (2a- 1)} as a basis for W. Thus the orthogonal projection 


of h is 1 1 
h,1)1 + (h, 24-1 22-1 
(h, 1)1 + ( va Lar. ) 
29 #1 1 
=—+ 24-1) =—(4+1). 
Bg eget 
20. If v is the orthogonal projection of u, then the distance of u is the length 
of wu-v. 


(a) The distance is 


i(2.6)- 0.4] 2a. 


(b) The distance is 
|(2, 1,3) - (29, 17,40) = — 
at) 14 ’ ) TA 
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21. 


22. 


23. 


(c) The distance is 


1 [30527 
~277 4+ 3xr+4)-= 1) =, /——. 
\(-22° + 3a + 4) get I 1215 


Do this with the same method as that in Exercise 6.2.19. Let {u1, w2, us} 
be the orthonormal basis given by Example 5 of this section. Then the 
closest second-degree polynomial approximation is the orthogonal projec- 
tion 

(e",u1)uy + (e°, uz) ue + (e*, uz)ug 

1 

= al Gse- 105e7*)t? + 12¢ - 3e7 + 33). 

(a) Use the Gram-Schmidt process to find a orthogonal basis {t, - St-5Vt} 
and normalize it to be 


{/3t, -V2(6t — 5V‘t)}. 
(b) Do it as that we’ve done in Exercise 6.2.19. The approximation is 


=: 20\V/t — 45t 
28 


—— 
@ 
wa 


Let x(n), y(n), z(n) be sequences in the condition of inner product 
space. Since all of them has entry not zero in only finite number of 
terms, we may find an integer N such that 


t(n) = y(n) = 2(n) 
for alln > N. But this means that all of them are vectors in F™. So 
it’s an inner product. 
(b) It’s orthogonal since 


co 


(e1,€)) = >; ei(n)e;(n) 


n=1 


= e;(i)e;(z) + ex()e;(J) = 0. 
And it’s orthonormal since 


co 


(ei,e:) = > ei(n)ei(n) = €;(4)e:(4) = 1. 


n= 


(c) i. Ife; is an element in W, we may write 
€1 = A101 + A202 +++ AOR, 


where a; is some scalar. But we may observe that a; must be zero 
otherwise the i-th entry of e; is nonzero, which is impossible. So 
this means that e; = 0. It’s also impossible. Hence e; cannot be 
an element in W. 
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ii. If a is a sequence in W*, we have a(1) = —a(n) for all i since 
(a,0n) = a(1) + a(n) =0 


for all 7. This means that if a contains one nonzero entry, then 
all entries of a are nonzero. This is impossible by our definition 
of the space V. Hence the only element in W~ is zero. 

On the other hand, we have W* = {0}* = V. But by the previous 
argument, we know that W # V = (W?*)?. 


6.3. The Adjoint of a Linear Operator 


1. (a) Yes. See Theorem 6.9. 


(b) No. It just for the linear mapping V > F. For example, the value of 
the identity mapping from R? to R? is not an element in F. 

(c) No. The equality holds only for the case 8 is an orthonormal basis. 
For example, let A = ; ) and T = Ly. Thus we have T” = Las. 


But for the basis 


B= 1 1), (0, 1)}, 


we have 


(d) Yes.See Theorem 6.9. 

(e) No. Choose a =i, b=0 and T = Ic = U. We have (aT)* = aT” # aT”. 
(f) 

(g) Yes. See Theorem 6.11. 


Yes. See Theorem 6.10 and its Corollary. 


2. Follow the prove of Theorem 6.8. The vector would be 
y=) g(vi)vi- 
i=1 


(a) The vector is (1, -2,4). 
(b) The vector is (1,2). 
(c) The vector is 210a? - 204 + 33. 


3. Use the definition and that skill used in the previous exercises. 


(a) By definition we have 
((a, b), T*(x)) = ((2a + b, a — 3d), (3,5)) = lla - 120. 


We may observe that T*(a) = (11,-12). 
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(b) By definition we have 
(21, 22),T*(«)) = (224 + 129, (1 = i)z1), (3 —4, 1+ 2i)) 
= (5 = i)z1 + (-1 + 3i)z2. 
We may observe that T*(a) = (5+ i,-1- 32). 
(c) By definition we have 


(at +b, T*(x)) = (at + (a+ 3b), 4 — 2t) 


i. —Gat? + (-2a + 6b)t + (4a + 12b)dt 
We may observe that T* (a) = 6t + 12. 
4. (b) Compute 
(x, (eT) "(y)) = (eT (2), 9) = 
(T(x), ¢y) = (a, T"(cy)) = (2, eT" (y)) 
for all x and y. 
(c) Compute 
(x, (TU)"(y)) = (TU(«),y) 
= (U(x), T"(y)) = (2, U"T"(y)) 
for all x and y. 


(e) Compute 
(x, I"(y)) = (1(@), 9) 
= (x,y) = (x, I(y)) 
for all x and y. 
5. (a) Just write it down just as that in the proof of (c). 
(a) Compute 
Lc A+B) = (Lise = (La + Lp)" 
=(La)* + (Lp)* = Lax t+ Lee = Laren. 

(b) Compute 

L(ea)* = (Lea)” = (eLa)* 

= c(La)” = CL a* = Laat. 


(d) Compute 
Lye Eay = La. 


(e) Compute 
Lear ety 
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(b) The statement for nonsquare matrices has no difference with the 
Corollary but the statement (e) cannot holds since there’s no non- 
square identity matrix. To this result come from that A* is the 
conjugate of A®. 


6. Compute 
oe ey ha ay Ge en Gr NA 
and 
Ue AT aT AT 20, 
7. Let A= E i Then N(A) # N(A*) since (0,1) is an element only in 


the later one. 


8. If T is invertible, then we have the inverse mapping T~!. Then we have 
a Ea eA Raa) Se a 6 


9. For each vector v € V we may write v = vj + vg such that v; « W and 
vo € W+. Now check 


(x1 + 22,T" (yi + y2)) = (T(21 + £2), y1 + Y2) 


= (1,41 + Yo) = (21,91) 


and 
(x1 + %2,T (yi + y2)) = (#1 + 2,41) 


= (1,41) = (v1 + 22,T" (yi + y2)) 
for all x =21 +22 and y=y1 + yo. 


10. The sufficiency is easy since we may just pick y = x. For the necessity, 
suppose now |T'()|| = ||a||. By Exercise 6.1.20 we have 


4 4 
=F LIT +E YP = 5D FIT@ +ETOP 
k=1 k=1 
(T(x),T(y)) 


if F = C. However, for the case F = R the above argument could also work 
if we just pick k& to be 2 and 4. 
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11. We have 


12. 


0 =(T*T (2), x) = (T(x), T(z)) = |T(#)? 


for all x and hence T(x) = 0 for all x. And the second statement is also 
true since we may write 


A Base halal ant 


and get T* = To. Since T = T** = Tp = To. 


(a) If  ¢ R(T*)* we have 


wa 


0 = (a,T*(y)) = (T(2),y) 


for all y. This means that T(x) = 0 and so x¢ N(T). Conversely, if 
x € N(T), we have 


(x, T"(y)) = (T(z), y) = 0 
for all y. This means that x is an element in R(T*)+. 


By Exercise 6.2.13(c) we have 
N(T) =(R(T")")* = R(T"). 
If ce N(T*T) we have T*T(x) = 0 and 
0=(T°T (a), a) = (T(x), T(@)). 
This means that T(x) = 0 and x € N(T). Conversely, if « « N(T), 


we have T*T(x) = T*(0) =0 and so xe N(T*T). 
On the other hand, since the dimension is finite, we have 


R(T*T) = N(T*T)* = N(T)* = R(T) 
by the previous exercise. Hence we have 
rank(T*T) = rank(T”) = rank(T) 


by the next argument. 


For arbitrary matrix A, denote A to be the matrix consisting of 
the conjugate of entris of A. Thus we have A* = At. We want to 
claim that rank(A) = rank(A*) first. Since we already have that 
rank(A) = rank(A‘), it’s sufficient to show that rank(A) = rank(A). 
By Theorem 3.6 and its Corollaries, we may just prove that {v;}ier 
is independent if and only if {Gj};e7 is independent, where 0; means 
the vector obtained from v; by taking conjugate to each coordinate. 
And it comes from the fact 


Yo aiv; = 0 


wel 
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if and only if 

orn = = AAU; = 0. 

tel tel 
Finally, by Theorem 6.10 we already know that [T]% = [T*], for 
some basis 3. This means that rank(T) = rank(T*). And so 


rank(TT” ) = rank(T”**T") = rank(T™) = rank(T). 
(c) It comes from the fact L4+ = (La)*. 


14. It’s linear since 
T (cv, + 22) = (cr, + 2, y)z 


= c(@1,y)z + (22, y)z = cT (a1) + T(22). 
On the other hand, we have 
(u,T"(v)) = ((u,y)z, v) 
= (u,y){z,v) = (uw, (v, 2)y) 
for all u and v. So we have 
T" (x) = (x, z)y. 
15. (a) Let ye W be given. We may define 


Gy (2) = (T(x), y)2, 


which is linear since T is linear and the first component of inner 
product function is also linear. By Theorem 6.8 we may find an 
unique vector, called T*(y), such that 


(x, T"(y))1 = (T(2),y)2 
for all x. This means T*(y) is always well-defined. It’s unique since 
(x,T"(y))1 = (2,U(y))1 


for all x and y implies that T* =U. 
Finally, it’s also linear since 


(x, T*(y+cz))1 =(T(2), y+ cz)o 
=(P(a),y}2+ AT (2), 2)2 
= (2,T"(y))1 +(x, T"(z))1 
= (x,T"(y))1 + (2, cI" (z))1 = (2, T"(y) + eT" (Zz) (1 


for all x, y, and z. 
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(b) Let 
and 


Further, assume that 
n 
T(v;) -_ > aij us: 
i=1 


This means that [T]} = {ai;}- 
On the other hand, assume that 


T*(u;) = Yeast 
And this means 
Cig = (Vis T” (Uy ))1 = (T (vi), Ug )2 = aya 
and (T*}2 = ((TY3)* 
(c) It comes from the same reason as Exercise 6.3.13(b). 
(d) See 
(7 (x),y) = (y,T*(a)) 
=(T(y), a) = (2, T"(y)). 
(e) If T(x) = 0 we have T*T(x) = T*(0) =0. If T*T(x) = 0 we have 
0=(x,T"T()) = (T(x), T(2)) 
and henve T(x) = 0. 


16. (a) Compute 
(x,(T+U)"(y))1 = (2+ U)(2), y)2 


= (T(x) + U(x), y)2 = (T(®), y)2 + (U(x), y)2 
= (x, T"(y))1 + (@,U"(y))a = (a, (2 + U")(y))1 
for all x and y. 


(b) Compute 
(x, (cT’)*(y))1 = (cT'(2), y)2 = 


(T(x), y)2 = (x, T" (ey))1 = (2, eT" (y))1 


for all x and y. 
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17. 


18. 


(c) Let T isa mapping on W and U is a mapping from V to W. Compute 
(x,(TU)*(y))1 =(TU(z),y)2 


= (U(x), T"(y))2 = (2, U°T"(y))1 
for all x and y. 
Let T is a mapping from V to W and U is a mapping on V. Compute 
(x, (TU)"(y))1 = (TU(2), y)2 


=(U(@),T*(y)hi = (2, UT" (y))1 
for all x and y. 


(d) Compute 
(x,T™"(y))2 = (T"(x),y)1 
= (x,T(y)) 


for all x and y. 


If ce R(T*)* we have 


O= (2, T"(y))1 = (T (2), y)2 
for all y. This means that T(2) = 0 and so x € N(T). Conversely, if 
x € N(T), we have 
(x, T"(y))1 =(T(2),y)2 = 0 
for all y. This means that x is an element in R(T*)*. 
For arbitrary matrix M we already have det(M) = det(M‘). So it’s suf- 


ficient to show that det(1/) = det(W). We prove this by induction on n, 
the size of a matrix. For n = 1, we have 


det (a) = det (a) . 
For n = 2, we have 
a b\ ——_ 
dee (2 ') = ad-— be 
oe ab 
= ade = det (2 i). 


Suppose the hypothesis is true for n = k-1. Consider ak x k matrix M. 
We have 


det(M) = 3 (1) Mi; - det(Mi;) 


j=l 


= y (-1)"9 Mi; - det(Mi;) 
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20. 


21. 


22. 


23. 


= > (-1)'*7 Mi; det(M;;) i det(M). 


This means that 
det(A) = det(A‘) = det(A%*). 


Let v; be the i-th column of A. Then we have v; is the i-th row of A%*. 
And the desired result comes from the fact 


(A*A)ig = 05 05 = (Uz, Vi), 
which is zero when 7 # j. 
Follow the method after Theorem 6.12. 


a The linear function is -2¢+ 3 with error E = 1. The quadratic function 
s 5t? — $t+2 with the error E =0. 


ae The linear function is 3t + 35 5 with error E = ae The quadratic 
function is 7t? + 3 + aay with the error E= &. 


c) The linear pend is - ot + 4 with error oe =. The quadratic 
5 5 


function is -70? - 2t+ 8 8 with the error FE = 4 


Follow the same method. We have the linear function is 2.1a — a So 
the spring constant is 2.1. 


As the statement in Theorem 6.13, we may first find a vector u such that 
AA*u=b. Finally the minimal solution would be A*u. 


(a) The minimal solution is (2,4,-2). 

(b) The minimal solution is (2,3, 1). 

(c) The minimal solution is (1,, -3, 5). 
(d) The minimal solution is 4(7,1,3,-1). 
(a) Direct calculate that 


and get the result by 


A*A (‘) = Aty. 


For the second method, we may calculate that 
m 9 
E= >> (yi - ci - d) 
i=1 
and aE = 0 and ge = 0 give the normal equations. 
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(b) We want to claim that y = ct+d, whrer t, 7 are defined in the question 
and c, d are a solution of the normal equations. But this is an instant 
result by dividing the second equation by m. 


24. (a) Check 
T(co +7)(k) = > (co +7)(k) 


: ed (@)() + YH) 


=cT(o)(k) +T(7)(k). 
(b) For k <n we have 
T(€n)(k) = Yen(i =1s Seb): 


i=1 


And for k > n we have 
T(en)(K) = Jo en(i) = 0 Yo ex(b. 
(c) Suppoe that T* exist. We try to compute T*(e1) by 
1 T(E Mi) = (en, T*(e1)) = (Dever) = 1 


This means that T*(e,)(é) = 1 for all i. This is impossible since 
T*(e1) is not an element in V. 


6.4 Normal and Self-Adjoint Operators 


1. (a) Yes. Check TT* =T? =T*T. 


. 1 1 1 0 1 0 
(b) No. The two matrices (; ) and ( ) have (i) and (") to be 
their unique normalized eigenvectors respectly. 
(c) No. Consider T(a,b) = (2a,b) to be a mapping from R* to R? and 
B to be the basis {(1,1),(1,0)}. We have T is normal with T* = T. 


1 
But [T], = ( is not normal. Furthermore, the converse is also 


0 
ab 32 
not true. We may let T(a,b),= (b,b) be a mapping from R? to R? 
and £ be the basis {(1,-1),(0,1)}. In this time T is not normal with 


T*(a,b) = (0,a+ 6). However, [T],g = ; : is a normal matrix. 


(d) Yes. This comes from Theorem 6.10. 
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(e) Yes. See the Lemma before Theorem 6.17. 


(f) Yes. We have I* = I and O* = O, where J and O are the identity and 
zero operators. 


(g) No. The mapping T(a,b) = (—b,a) is normal since T*(a, b) = (a,—0). 
But it’s not diagonalizable since the characteristic polynomial of T’ 
does not split. 


(h) Yes. If it’s an operator on a real inner product space, use Theo- 
rem 6.17. If it’s an operator on a complex inner product space, use 
Theorem 6.16. 


2. Use one orthonormal basis 6 to check [T]g is normal, self-adjoint, or 
neither. Ususally we'll take 8 to be the standard basis. To find an or- 
thonormal basis of eigenvectors of T’ for V, just find an orthonormal basis 
for each eigenspace and take the union of them as the desired basis. 


(a) Pick 8 to be the standard basis and get that 


2 -2 
me~(% 3). 
So it’s self-adjoint. And the basis is 


alg coy 
ie es 


(b) Pick 6 to be the standard basis and get that 


ce a 
o-(1 5 i. 
A: 20: 


So it’s neither normal nor self-adjoint. 
(c) Pick 6 to be the standard basis and get that 


[T']e = iE Hi 


So it’s normal but not self-adjoint. And the basis is 


1 1 1. te Des Me 
(Gas 


(d) Pick an orthonormal basis 6 = {1, /3(2t - 1), /6(6t? - 6t + 1)} by 
Exercise 6.2.2(c) and get that 


0 2/3 0 
rny-( 0 wa) 


(1,-2), (2,1)}. 


0 0 0 


So it’s neither normal nor self-adjoint. 
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(e) Pick 6 to be the standard basis and get that 


0 
0 
1 
0 


a) 


0 
1 
0 
0 
So it’s self-adjoint. And the basis is 


1 1 
{(1,0,0,0), ae 1,1,0), (0, 0,0, 1), Wei 1,-1,0)} 


(f) Pick 6 to be the standard basis and get that 


001 0 
The=|1 0 0 of: 
010 0 
So it’s self-adjoint. And the basis is 
(2 (01,0), =5-(0 10,1); 00,150) =O} 
V2 V2 V2 V2 


3. Just see Exercise l(c). 


4. Use the fact 
(TU)* =U*T* =UT. 


5. Observe that (T - cl)* = T* - GI and check 
(Pcl)(T +e1)* = (0 =e Sel 2 TP ot ere cP 


and 


(T -cl)*(T -cl) = (T* -t1)(T - cl) = T*T - a - cP + |c? 1 


They are the same because TT7* = 7" T. 


6. (a) Observe the fact 
1 1 
Ty = (S(P+T"))" = a (F +T)=T, 
and 
1 
24 
(b) Observe that T* = Uf -iU3 =U, -iU2. This means that 


Ty = (5(P-T"))" =-5(0* - 7) = Te 


1 
U=5(7+T")=T 


and ; 
aera eae aes 
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(c) Calculate that 
= is 1 2 * Eze a *\2 ad 2 * mem _ *\2 
TiTy-TeTy = 5 (1?-TT*+T*T-(I*)?)- 2 (1? +I T*-T*T-(T"*)’) 
a i} 


1 
=—(T*T-TT"). 
24 
It equals to To if and only if T is normal. 


7. (a) We check 
(x, (Tw )"(y)) = (Tw (2), 9) = (1(@), 9) 
=(T"(x),y) = («,T(y)) = (x, Tw (y)) 
for all x and y in W. 
(b) Let y be an element in W*. We check 


(x, I" (y)) = (T(x), y) = 0 


for all z € W, since T(x) is also an element in W by the fact that W 
is T-invariant. 


(c) We check 
(x, (Tw)"(y)) = (Zw (a), y) = (12), 9) 


(x,T"(y)) = (2, (T")w(y))- 


(d) Since T is normal, we have TT* = T*T. Also, since W is both T- 
and T*-invariant, we have 


(Tw)" = (T")w 
by the previous argument. This means that 


Tw (Iw)* = Tw(I")w = (1 )wTw = (Tw) "Tw. 


8. By Theorem 6.16 we know that T is diagonalizable. Also, by Exercise 
5.4.24 we know that Ty is also diagonalizable. This means that there’s a 
basis for W consisting of eigenvectors of T. If x is a eigenvectors of T, then 
x is also a eigenvector of T* since T is normal. This means that there’s a 
basis for W consisting of eigenvectors of T*. So W is also T-invariant. 


9. By Theorem 6.15(a) we know that T(«) = 0 if and only if T*(a) = 0. So 
we get that N(T) = N(T*). Also, by Exercise 6.3.12 we know that 


R(T) = N(T*)* = N(T)* = R(T"). 
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10. Directly calculate that 
T(x) + ial? = (T(x) + ix, T(x) + ix) = 
= |T(2) |? + (T(a), ix) # (ix, T(2)) + lla? 
= |T(x)|? ¥ (T(x), 2) (T*(x), 2) + |x|? = |T(z)|? + Il’. 


Also, T + 7I is injective since |T(x) + x|| = 0 if and only if T(x) = 0 and 
x =0. Now T-iI is invertible by the fact that V is finite-dimensional. 
Finally we may calculate that 


(x, (PT - al)" ]*(L + i1)(y)) = (LiL) (2), (L + 1) (y)) 


= ((T-aI)"(x), (L* + i2)(y)) = (LT - i)" (x), (T -i1)*(y)) 
= ((T-i1)(T - i)" (x), y) = (x,y) 
for all x and y. So we get the desired equality. 


11. (a) We prove it by showing the value is equal to its own conjugate. That 
is, 


(T(x), a) = (x, T(«)) 
(x, T"(x)) = (T(a), 2). 
(b) As Hint, we compute 


0=(T(a+y),x+y) 


=(T(x),x) + (T(x), y) + (Ty), 2) + (Ty), 9) 
=(T(x),y) + (T(y), 2). 
That is, we have 
(T(a),y) = (T(y), 2). 
Also, replace y by iy and get 
(T(x), ty) = -(T (iy), 2) 


and hence 
-i(T (x), y) = -i(T(y), 2). 
This can only happen when 


(7(a),y) = 0 


for all x and y. So T is the zero mapping. 
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12. 


13. 


(c) If (T(x), x) is real, we have 
(T(x), a) = (a, T(2)) = (1" (x), 2). 


This means that 

((T-T")(x),«) = 0 
for all «. By the previous argument we get the desired conclusion 
T=T". 


Since the characteristic polynomial splits, we may apply Schur’s Theorem 
and get an orthonormal basis ( such that [T],g is upper triangular. Denote 
the basis by 


GB = {v1,V2,..-,Un}- 
We already know that v; is an eigenvector. Pick ¢ to be the maximum 
integer such that v,,v2,...,vz are all eigenvectors with respect to eigen- 


values \;. If t =n then we’ve done. If not, we will find some contradiction. 
We say that [T]g = {Ai,;}. Thus we know that 
t+1 
T(ve41) = >) Aas Ui. 


1=1 


Since the basis is orthonormal, we know that 
Ajt+1 = (T'(vt41), Va) = (ve41, 2" (v4 )) 


= (vi41, \vi) = 0 


by Theorem 6.15(c). This means that v,41 is also an eigenvector. This is 
a contradiction. So 8 is an orthonormal basis. By Theorem 6.17 we know 
that T’ is self-adjoint. 


If A is Gramian, we have A is symmetric since A’ = (B'B)' = B'B = A. 
Also, let \ be an eigenvalue with unit eigenvector x. Then we have Az = Ax 


and 
d= (Aa, x) = (BY Bz, x) = (Bx, Ba) > 0. 


Conversely, if A is symmetric, we know that Ly is a self-adjoint operator. 
So we may find an orthonormal basis 8 such that [LZ], is diagonal with 
the iz-entry to be Ai. Denote D to be a diagonal matrix with its i7-entry 
to be Aj. So we have D? = [La]g and 


A=[1]g[La]ol1e = (U¢D)(DUI0), 


where a is the standard basis. Since the basis 6 is orthonormal, we have 
[1] = ([1]2)*. So we find a matrix 


B = Dit} 


such that A = BB. 
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14. 


15. 


16. 


We use induction on the dimension n of V. If n = 1, U and T will be 
diagonalized simultaneously by any orthonormal basis. Suppose the state- 
ment is true for n < k-1. Consider the case n = k. Now pick one arbitrary 
eigenspace W = E) of T for some eigenvalue A. Note that W is T-invariant 
naturally and U-invariant since 


TU(w) =UT(w) = U(w) 


for allwe W. If W = V, then we may apply Theorem 6.17 to the operator 
U and get an orthonormal basis ( consisting of eigenvectors of U. Those 
vectors will also be eigenvectors of T’. If W is a proper subspace of V, we 
may apply the induction hypothesis to Tw and Uw, which are self-adjoint 
by Exercise 6.4.7, and get an orthonormal basis 6, for W consisting of 
eigenvectors of Tw and Uw. So those vectors are also eigenvectors of T 
and U. On the other hand, we know that W* is also T- and U-invariant 
by Exercise 6.4.7. Again, by applying the induction hypothesis we get an 
orthonormal basis 62 for W* consisting of eigenvectors of T and U. Since 
V is finite dimentional, we know that 6 = 6; U 62 is an orthonormal basis 
for V consisting of eigenvectors of T and U. 


Let T = La and U = Lg. Applying the previous exercise, we find some 
orthonormal basis 6 such that [T]g and [U],g are diagonal. Denote a to 
be the standard basis. Now we have that 


[T]s = W2AL]3 
and 
[U]s= (2 BL13 
are diagonal. Pick P =[J]% and get the desired result. 


By Schur’s Theorem A = P~! BP for some upper triangular matrix B and 
invertible matrix P. Now we want to say that f(B) =O first. Since the 
characteristic polynomial of A and B are the same, we have the charac- 
teristic polynomial of A would be 


f(t) = TT(B. -1) 


since B is upper triangular. Let C = f(B) and {e;} the be the standard 
basis. We have Ce; = 0 since (Bi,J-B)e, = 0. Also, we have Ce; = 0 since 
(B,,I - B)e; is a linear combination of e),e2,...,e;-1 and so this vector 
will vanish after multiplying the matrix 


B 


(By I - B). 


jal 


So we get that f(B) =C =O. Finally, we have 


f(A) = f(P"'BP) = P™!f(B)P =O. 
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17. 


(a) 


— 
ion 
wm 


By Theorem 6.16 and Theorem 6.17 we get an orthonormal basis 
@ = {U1,V2,---,Un}, 


where v; is the eigenvector with respect to the eigenvalue \;, since T 
is self-adjoint. For each vector x, we may write it as 


n 
w= x A,U;j.- 
t=1 


Compute 


(T(x),x =(y ajAjVi, » a;V;) 


_ 2 
— » la,| rG- 
i=1 


The value is greater than [no less than] zero for arbitrary set of a;’s 
if and only if A; is greater than [no less than] zero for all i. 
Denote 6 to be 

{e1, €Q,+++5 Cn}. 


For each x € V, we may write it as 


Also compute 


(T(e).2) = (0 (9) Ay wasder Ye) 


t=1 j=l 


n n 
= yee Aja; )G = >) Aigajt. 
i=1 j=l ig 
Since T' is self-adjoint, by Theorem 6.16 and 6.17 we have A = P* DP 
for some matrix P and some diagonal matrix D. Now if T is positive 
semidefinite, we have all eigenvalue of T’ are nonnegative. So the iz- 
entry of D is nonnegative by the previous argument. We may define a 
new diagonal matrix EF whose ii-entry is /Dj;. Thus we have E? = D 
and A=(P*E)(EP). Pick B to be EP and get the partial result. 
Conversely, we may use the result of the previous exercise. If y = 
(a1, d2,...,@,) is a vector in F”, then we have 


y’ Ay = > AjjQjy 
tj 


and 
y* Ay = y* B* By = (By)* By = |By|? > 0. 
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18. 


(d) Since T is self-adjoint, there’s a basis 8 consisting of eigenvectors of 


Ne 


——, 


ae 


wa 


T. For all x € 6, we have 
UU? (Gy Fay aia. 
If \=0, then we have U?(x) = 0 and so U(x) =0= T(z) since 
(U(x), U(a)) = (U*U(a), x) = (U? (a), 2) = 0. 


By the previous arguments we may assume that ’ > 0. And this 
means that 


0 = (U? -71)(z) = (U + ATU - d1)(2). 


But det(U + AJ) cannot be zero otherwise the negative value —A is an 
eigenvalue of U. So we have U + XI is invertible and (U - AI)(ax) = 0. 
Hence we get U(x) = Ax = T(x). Finally since U and T meet on the 
basis 8, we have U =T. 


We have T and U are diagonalizable since they are self-adjoint. Also, 
by the fact TU = UT and Exercise 5.4.25, we may find a basis 8 
consisting of eigenvectors of U and T. Say x € 8 is an eigenvector of 
T and U with respect to 4 and py, who are nonnegative since T’ and 
U are postive definite. Finally we get that all eigenvalue of TU is 
nonnegative since TU(x) = Aux. So TU = UT is also positive definite 
since they are self-adjoint by Exercise 6.4.4. 


Follow the notation of Exercise 6.4.17(b) and denote y = (a1, @2,...,@n). 
We have 


n 


Me 


(T(S aie), 


ase) = (>) (>) Aizas)ex, > ases) 
1 i=l i=l 


A, 


j 
2 Ajjajaj = y" Ay = (La(y),y)- 
ij 

So the statement is true. 


We have T*T and T'T™ are self-adjoint. If \ is an eigenvalue with 
the eigenvector x, then we have T*T'(x) = Ax. Hence 


d= (T*T (x), 2) = (T(x), T(x)) > 0. 


We get that T*T is positive semidefinite by Exercise 6.4.17(a). By 
similar way we get the same result for TT”. 


We prove that N(T*T) = N(T). If xe N(T*T), we have 
(I"T (2), #) = (T(x), T(2)) = 0 


and so T(x) =0. If a ¢ N(T), we have T*T(x) = T*(0) =0. 
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Now we get that null(T*T) = null(T) and null(TT™*) = null(Z™) since 
T** =T*. Also, we have rank(T) = rank(T™*) by the fact 


rank([T']g) = rank([T]3) = rank([T™ ]) 


for some orthonormal basis 8. Finally by Dimension Theorem we get 
the result 


rank(T*T) = rank(T) = rank(T™”) = rank(TT™). 


19. (a) It comes from that 
(([7+U)(«),x) =(T(x),x) + (U(x), 2) >0 
and (T+U)* =T*+U* =T+U. 
(b) It comes from that 
((cT)(x), x) =c(T (a), x) >0 
and (cT)* =¢T* =cT. 
(c) It comes from that 
(T7"(x),2) = (y, T(y)) > 0, 
where y= T~'(zx). Note that 
(PO) F* SFT) =F, 
So we have (T-')* = (T*) 1+ =T"l. 
20. Check the condition one by one. 


(x + 2,y)' = (T(a+2),y) 
= (T(x),y) + (T(z), y) = (ay) + (2,9). 


(cx, y) = (T(cx), y) 
= (T(x), y) = (a, y)’. 


(x,y) = (T(a),y) 
= (y, T(x)) = (T(y), 2) = (y, 2)’. 


(x, x)’ = (T(x), x) >0 


if x is not zero. 
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21. 


22. 


As Hint, we check whether UT is self-adjoint with respect to the inner 
product (z,y)’ or not. Denote F to be the operator UT with respect to 
the new inner product. Compute that 


(x, F*(y))! = (UT (x), y)' = (TUT (a), y) 


=(T(x),UT(y)) = (2, F(y)y’ 

for all and y. This means that UT is self-adjoint with respect to the 
new inner product. And so there’s some orthonormal basis consisting of 
eigenvectors of UT and all the eigenvalue is real by the Lemma before 
Theorem 6.17. And these two properties is independent of the choice of 
the inner product. On the other hand, T~! is positive definite by Exercie 
6.4.19(c). So the function (x,y) := (T~!(a), y) is also a inner product by 
the previous exercise. Denote F” to be the operator TU with respect to 
this new inner product. Similarly, we have 


(x, F™(y))" = (TU (x), y)" = (U(2), 9) 
= (T(x), TU(y)) = (x, Fy)" 
for all x and y. By the same argument we get the conclusion. 


(a) For brevity, denote V; and V2 to be the spaces with inner products 
(-,-) and (-,-)’ respectly. Define f, (a) = (x,y)! be a function from Vi 
to F. We have that f,(a) is linear for ¢ on V;. By Theorem 6.8 
we have f,(x) = (T(x), y) for some unique vector T(x). To see T is 
linear, we may check that 
(T(x +z),y) = (+ z,y) = (v,y) + (zy) 
=(T(x),y) + (T(z), y) = (T(x) + T(z), y) 
and 
(T'(cx),y) = (cx, y) = cla, y) 
AT (a), y) = (cT'(a),y) 
for all x, y, and z. 
(b) First, the operator T is self-adjoint since 


(x, T*(y)) = (T(x), y) = (x,y) 


= (y, x) = (T(y), x) = (x, T(y)) 
for all x and y. Then T is positive definite on V; since 


(T(x), x) = (x,x)'>0 


if x is not zero. Now we know that 0 cannot be an eigenvalue of T. 
So T is invertible. Thus T~! is the unique operator such that 


(x,y) =(T7*(a), 9. 


By the same argument, we get that T~! is positive definite on V2. So 
T is also positive definite by Exercise 6.4.19(c). 
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23. 


24. 


As Hint, we denote V; and V2 are the spaces with inner products (-,-) 
and (:,-)’.. By the definition of (-,-)’, the basis 6 is orthonormal in V3. 
So U is self-adjoint on V2 since it has an orthonormal basis consisting of 
eigenvectors. Also, we get a special positive definite, and so self-adjoint, 
operator T, by Exercise 6.4.22 such that 


(x,y) = (T(a),y). 
We check that U = Ty!U*T, by 
(x, T'U*Ty(y)) = (TUT; *(2),y) 
= (UT; *(x),y)' = (Ty*(2),U(y)y = (x, U(y)) 
for all x and y. So we have U = Poh and so 7;U = U*T,. Pick 
T. = T;'U* and observe that it’s self-adjoint. Pick Tj = Ty’ to be a 


positive definite operator by Exercise 6.4.19(c). Pick T; = U*T; to be a 
self-adjoint operator. Now we have U = TT, = TiT3. 


(a) Let 
GB = {v1,02,..-,Un} 
and 
y = {W1, W2,---,Wn} 
be the two described basis. Denote A to be [T]3. We have 
Ant 


T(w1) = T( i > T(v1) = An T(1). 


I 

Let t be the maximum integer such that T(w;) is an element in 
span{wy1, W2,..., Wr}. 

If t = dim(V), then we’ve done. If not, we have that 


t 


1 
Wt+1 = —(u% = Y (Vir1, Wz )W;), 
L j=l 


where 
t 


L= |v; - ee (Vt41, Wy) W; [. 


jal 
By the definition of w;’s we may define 


W; =span{v1,v2,...,Un} =span{wy, W2,..., We}. 


Now we have T(w;) € W; since 
1 t-1 
Twe) = (Pv) - (ve, wy) T(wy)) 
j=l 


and T(v,) « W; and T(w;) « W; c W; for all 7 < t. This is a contra- 
diction to our choice of i. So [T'], is an upper triangular matrix. 
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(b) Ifthe characteristic polynomial of T splits, we have an ordered basis 6 
such that [T], is upper triangular. Applying the previous argument, 
we get an orthonormal basis y such that [T]., is upper triangular. 


6.5 Unitary and Orthogonal Operators and Their 
Matrices 


1. (a) Yes. See Theorem 6.18. 
(b) No. Each rotation operator with nonzero angle is a counterexample. 


(c) No. A matrix is invertible if it’s unitary. But an invertible matrix, 


i; | for example, may not be unitary. 


(d) Yes. It comes from the definition of unitarily equivalence. 


(e) No. For example, the idenetity matrix J is an unitary matrix but the 
sum J+J is not unitary. 


(f) Yes. It’s because that T is unitary if and only if TT* =7T*T = J. 


—— 
ithe} 
Wa 


No. The basis @ should be an orthonormal basis. For example, we 
have T(a,b) = (b,a) is an orthogonal operator. But when we pick 8 


to be 
{(1,1),(1,0)} 
1 


we get that [T], = (; | is not orthogonal. 


(h) No. Consider the matrix (A i) Its eigenvalues are 1. But it’s not 
orthogonal. 
(i) No. See Theorem 6.18. 
2. Just follow the process of diagonalization. But remember that if the di- 


mension of some eigenspace is more than 1, we should choose an orthonor- 
mal basis on it. 


(a) 


(b) 
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1 1 1 

Va va Ve 4 0 0 
P=|—5 0 -¥4|,D=]|0 -2 0 

Me say Ae Of, 8 

Ve. “32 6 


1 1 1 
Ve Va 4 0 0 
P=|4 o -¥],D=]0 1 Of. 
e 001 
[ 


3. If T and U are unitary [orthogonal] operators, then we have 


[ZU (2)Il = |) = lel. 


4. Pick the standard basis 3 and compute the matrix representation [T; |g = 
(z). This means that T7 = Tz. So it always would be normal. However, 
it would be self-adjoint only when z is real. And it would be unitary only 
when |z| = 1. 


5. For these problem, try to diagonalize the matrix which is not diagonalized 
yes. And check whether it can be diagonalized by an orthonormal basis. 


(a) No. They have different eigenvalues. 

(b) No. Their determinant is different. 

(c) No. They have different eigenvalues. 

(d) Yes. We have 
0% w\)fo 10°00 a wz) foo 
0m -z (an allo SL). (0 : 7 
1 O 0 0 O 1/\1 oO 0 0 O -2 


(e) No. One is symmetric but the other is not. 


6. If T is unitary, we must have 
1 1 
0= ICAI - LF? =f inPlsPae— ff Lae 


= f° a-(wPyipPae 


for all f ¢ V. Pick f = (1-|hAl?)2 and get 1-|hl? = 0 and so |h] = 1. 
Conversely, if |h| = 1, we have 


1 1 
ITP-WP =f aPireae— [pat 


= [° @-InPyipPat =o 


and so T is unitary. 
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7. 


10. 


By the Corollary 2 after Theorem 6.18, we may find an orthonormal basis 
8 such that 


0 : 
Pipe 2 9 
Qi: des, Gio We 


Also, since the eigenvalue »; has its absolute value 1, we may find some 
number jp; such that yi? = A; and |y;| = 1. Denote 


Hy OS 0 
te 0 L2 : 
ae : - 0 
Or Hh OL, is 


to be an unitary operator. Now pick U to be the matrix whose matrix 
representation with respect to G is D. Thus U is unitary and U? = T. 


. Exercise 6.4.10 says that ((T’-iI)~!)* =T'+iI. So check that 


[(P +i)(T -il)")* (8 +i1)(T-it) 
= ((T-a1)1)*(T + 40)* (1 + i1)(T - a1)" 
=(T+i1I) (TT -iD(T+id(T-ity! 
=(T+il) (1 +i) -il)(T-it) 1 = 1. 


Use Exercise 2.4.10 we get that the operator is unitary. 


. The operator U may not be unitary. For example, let U(a,b) = (a+ 6,0) 


be an operator on C?. Pick the basis {(1,0),(0,1)} and we may observe 
that 
UG, Of = U0, DI = 1 = 14,0) = 10, 1]. 


But it is not unitary since 

|UD) =14 10, )] = v2. 
Exercise 2.5.10 says that tr(A) = tr(B) is A is similar to B. And we know 
that A may be diagonalized as P* AP = D by Theorem 6.19 and Theorem 


6.20. Here D is a diagonal matrix whose diagonal entries consist of all 
eigenvalues. This means 


tr(A) = tr(D) = 3 ri 


and 
tr(A* A) = tr((PDP*)*(PDP")) 


S( PD DP yaw Dy =) yl: 
t=1 
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11. 


12. 


13. 


14. 


Extend {(¥, z, 2)} to be a basis and do the Gram-Schmidt process to get 
an orthonormal basis. The extended basis could be 


122 
—,-,-—),(0,1,0),(0,0,1 
{(3> 3’ 3) , | ), ( >) 7 )} 
and the othonormal basis would be 
122 2 5 2? 2 1 
{( 9 7 ),( ’ 9 ),( ,0, ee 
3/373 3V5 3/5 35 V5 A/S 
So the matrix could be 
1 2 2 
3 3 ae 
2 5 2 
3Y5 35 3/5 
SNe 
V5 V5 


By Theorem 6.19 and Theorem 6.20 we know that A may be diagonalized 
as P* AP = D. Here D is a diagonal matrix whose diagonal entries consist 
of all eigenvalues. Now we have 


det(A) = det(PDP”*) = det(D) = Il Vi- 


i=1 


The necessity is false. For example, the two matrices (i a and 


(0 e)-( 9) Ge) )) 


are similar. But they are not unitary since one is symmetric but the other 
is not. 


We may write A = P* BP. Compute 
(La(x),2) = (Lp+gp(2),2) 


= (LpLplp(2),2) = (Lp(Lp(2)), Lp(2)). 


If A is positive definite, for each vector y we may find some 2 such that 
Lp(a) = y since P is invertible. Also, we have 


(La(y),y) = (La(Lp(2)), Lp(2)) = (La(2), x) > 0. 
If B is positive definite, we may check that 


(La(x),#)(Lp(Lp(«)), Lp(x)) > 0. 
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15. 


16. 


17. 


18. 


(a) We have 
[Uw (@)| = 10 (2) = lel 
and so Uw is an unitary operator on W. Also, the equality above 


implies that Uy is injection. Since W is finite-dimensional, we get 
that Uw is surjective and U(W) = W. 


(b) For each elment w ¢ W we have U(y) = w for some y ¢ W by the 
previous argument. Now let x be an element in W+. We have U(a) = 
WwW, + We for some w, € W and we € W! by Exercise 6.2.6. Since U is 
unitary, we have some equalities 


lull? = lel’, 


|x|? = or + wel? = orl]? + Jewell? 


by Exercise 6.1.10. However, we also have that U(x + y) = 2w, + we. 
So we have that 
O= |x + yll? - |2w1 + well? 


= all? + [yl? — Aller]? - wall? = -2| 201 |”. 
This means that w, = 0 and so U(x) = woe W?. 


This example show the finiteness in the previous exercise is important. 
Let V be the space of sequence defined in Exercise 6.2.23. Also use the 
notation e; in the same exercise. Now we define a unitary operator U by 


U (e241) = €24-1 if @>  ; 
U(e1) =€2 3 
U(e:) = U (e242) if i>0O 


It can be check that |U(«)|| = ||| and U is surjective. So U is an unitary 
operator. We denote W to be the subspace 


span{eg, €4,€6;-- Fi 


and so we have 
ab 
W- = {e1, €3,€5, a aie 


Now, W is a U-invariant subspace by definition. However, we have eg ¢ 
U(W) and W? is not U-invariant since U(e1) = e2 ¢ W+. 


Let A be an unitary and upper triangular matrix. For arbitrary indices 7 
and j such that i > 7. We have A;; = 0 since A is upper triangular. But 
we also have that Aj; = Ajj = 0. So A is a diagonal matrix. 


Write A ~ B to say that A is unitarily equivalent to B. Check the three 
conditions in the Appendix A. 


reflexivity Since A= J* AI, we get A~ A. 
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symmetry If A~ B, we have B = P* AP and so A= PBP* =(P*)*BP*. 
This means that B ~ A. 


transitivity If A ~ B and B ~ C, we have B = P* AP and C = Q* BQ. 
This means that 
C= Q" BQ = Q*P*APQ = (PQ) A(PQ) 
and so A~C. 
19. By Exercise 6.1.10 we have 


|U (or + v2)? = lor - v2 |? 
= lui? + all? = lle + vall?. 


20. (a) If it’s a complex inner product space, we have 


4 
(U(x), U(y)) = Sy U(x) + Uy)? 


4 4 
dy PU (e+ iy)? = oe + iyl?? = (x,y). 


If it’s a real inner product space, we may also use the above equality 
but take the summation only over k = 2,4. 


(b) Since U(a+ y) = U(x) ifxe W an ye W"+, we know that R(U) = 
U(W). We get the desired result by applying the previous argument. 


(c) Just extend the set 
{v1,V2,...,UK} 


to be an orthonormal basis 
= {v1,V2,---,Un} 


for V, where n = dim(V). First we know that U(v,;) = 0 if j > k. 
So the j-th column of [U]., is zero. On the other hand, if we write 
A=([U]., we have 


U(v;) = > U4 505. 
i=1 


So the first k columns is orthonormal since 


0 = (U(vs),U(%:)) = (S Diatis Wie) 


UisUie = (Aet)* Aes 


Ms 


t=1 
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and ms 
1=(U(vs),U(vs)) = pe UisVi, 2, Vises) 


eu le yt te. 


Ms 


w=1 


(d) Since V is finite-dimensional inner product space, we have R(U)+ ® 
R(U)=V. And so by Exercise 6.5.20(b) we get the desired result. 


(e) First, T is well-defined since the set 6 defined in the previous question 
is a basis. To show that T = U*, it’s sufficient to check that 


(U(x), y) = (x,T(y)) 


for all x and y in § by Exercise 6.1.9. We partition @ into two parts 
X and Y, who consist of all U(v;)’s and w,’s. 


e Ifa=U(u;),y = U(v;) ¢ X, we have 
(U (vi), T(U(v5))) = (U (wi), v7) = (U* (vi), U (5) 


by Exercise 6.5.20(a). 
e Ife=U(u,) ¢X and y=w,; € Y, we have 


(U(v;), T(w;)) “Z (U(v;), 9) 
=0= (U?(v;), w;). 
e Ifx=w;¢ X and y= U(v;) € Y, we have 
(wi, T(U(v;))) = (wi, 07) = U (wi), U(0;)). 
e Ift=wi,y=w,; €Y, we have 
(w;,T(w;)) = (w;, 0) =OQ= (U(w;), w;). 


(f) Take the subspace W’ to be R(U). Thus we have T((W’)*) = {0} 
by the definition of T. Also, we may write an element x in R(U) to 
be 


k 
v= > aU(%). 
i=l 


Since the set of U(v;)’s is orthonormal, we have 
2) 2 : 2 2 
ZIP = Do las? =| DP aval” = IT (21. 
i=l i=l 


21. Since A is unitarily equivalent to B, we write B = P* AP for some unitary 
matrix P. 
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22. 


23. 


(a) Compute 
tr(B* B) = tr((P* AP)*(P* AP)) = tr(P* A* AP) = tr(A* A). 


(b) We compute the trace of A*A and get 


tr(A*A) = 3° (A* AY 


i=1 


=D (A* ie Ani = D)Aagl?. 
UJ 


i=l k=1 


1= 


B 


Use the result in the previous argument we get the conclusion. 


(c) By the previous argument, they are not unitarily equivalent since 
|1|? + [2|? + |2|? + |i]? = 10 


is not equal to 
lil? + |4|? +|1/? + |1|? = 19. 


(a) Let f(a) = 2+ be a translation. We may check it’s a rigid motion 
by 
If(2)- FMI = I@+t)- (+O) = le-yl- 


(b) Let f,g be two rigid motion, we have 


Ifo(x) — fol = Ig() - 9@@)I = la - yl 


So the composition of f and g is again a rigid motion. 


We define T to be T(x) = f(x) — f(0). By the proof of Theorem 6.22, we 
know that T is an unitary operator. Also, by Theorem 6.22 we know f 
is surjective since it’s composition of two invertible functions. Hence we 
may find some element ¢ such that f(t) = 2f(0). Now let g(x) = a+t. 
Since T is linear, we have 


Tog(a) =T(x+t) =T(x)+T(t) 


= f(x) - f(0) + F@) - F(0) = F(a). 
Finally, if f(7) = T(a+t) = U(x+ vo) for some unitary operator U and 
some element vp. We’ll have 


T(-vo + t) = U(-v0 ee v0) = 0. 


Since T is unitary and hence injective, we know that t = vp. And thus U 
must equal to JT. So this composition is unique. 
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24. (a) First, the composition of two unitary operators is again an unitary 
operator. So UT is an unitary operator. Since det(U) = det(T) = -1, 
we have 

det(UT) = det(U) det(T) = 1. 


This means that UT must be a rotation by Theorem 6.23. 
(b) It’s similar to the previous argumemnt. And now we have 
det(UT) = det(TU) = det(T) det(U) = 1-(-1) =-1. 
So they are reflections. 


25. By the proof of Theorem 6.23 we know that the matrix representations of 
T and U with repect to the standard basis a@ are 


cos2@ sin2¢ 
[T]a = Ge 2b  -cos 


and 
cos2y sin 2y 


[Ua = Gee ) ; 


So we have 


(OT la= WlalTle= (Soyo) coated) | 
Hence UT is a rotation by the angle 2(7- ¢). 
26. Here we have 
Ga eevee 


and 
_(cos2y  sin2y 


[U]a = Bae =) j 


(a) Compute 


[UT] =[UlalT]a= ee -%)  sin2(y- ) 


2 ) 
sin2(- 5) —cos2(%- ¢) 


So the angle is ¢) - g. 
(b) Compute 


cos2(y+$)  sin2(+ $) ). 


_ : ) 
[TU Ja ~ [TolV a = Fees 3) — cos 2(4) + ¢) 


So the angle is «+ ¢. 
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27. (a) We may write 


@ ob iO) 


Diagonalize the matrix and get 


where 


So we have 


(c) Diagonalize & | and get 
ve) 1 (2 3)\f{zx 
y} JIB\3 -2) Ny) 
(d) Diagonalize (' 5 and get 
w\ 1/1 1\fz 
Vp 2X) Ty 


(e) Diagonalize é 2) and get 


28. Denote (X’)' = (a’y’, z’). Then we have 
X! = PX, 


where P is the matrix in the solution of Exercise 6.5.2(e). 


199 


29. (a) We have the formula 


k 
valle = ve = We - Dd v; 


k (w Us k 
= Wk > ue 5) Uj = wre - >) (WK, Uz)U;- 
j=l Ia j=l 


(b) It directly comes from the formula above and some computation. 


(c) We have 
wy = (1, 1,0), we = (2,0, 1), wg = (2, 2,1) 
and 
(1, 1,0), v2 = (1,-1,1), v3 = (- pi 
v1 = = _ 
it 945U7), 92 ’ 94), 43 3° 3” 3 


by doing the Gram-Schmidt process. This means that we have 


12 2) f1 1 -3\f1 1 2 
10 2}=]1 -1 3 ]{0 1 3}. 
0117 \o 1 ¢/\0 0 1 


Then we may also compute 
lel) = V2, |v2l| = V3, llesll = SE 


Now we have 


1 1 -1\(v2 0 0o\"/v2 0 Olja 1 2 
2/4 -<t- 21/00 473: 00 0 V3 Offo 1 2 
01 2}lo o @ 0 o }\o o 1 
3 3 V3 
1 1 1 3 
Wan MG Ye \ |e NE 
-|}% -- S=]/o0 v3 4 
Tes Bs ° 8 vA 
1 2 2 
Oe AO! Oro 
Here the we have : rr ; 
a Sar ae 
Q-|% "A Ve 
o tL « 
va VB 
and : 
V2 V2 22 
R=|0 V3 4 
0 0 
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(d) First that Qi, Q2 and Ri, Ry are invertible otherwise A cannot be 
invertible. Also, since Qi, Qz is unitary, we have Qj = Qj! and 
Q3 = Q3'. Now we may observe that Q:Q3 = R2Rj' is an unitary 
matrix. But R2Rj' is upper triangular since Ry and the inverse of an 
upper triangular matrix R, are triangular matrices. So D = R2Rj! 
is both upper triangular and unitary. It could only be a unitary 
diagonal matrix. 
1 
Denote b by | 11 |. Now we have A = QR=b. Since Q is unitary, we 
-1 
have R = Q*b. Now we have 


Oe ae: 3508 


1 % = 
0 V3 Z)=R=Q*b= a 
OF o% “2 


— 
fo) 
we 


Then we may solve it to get the answer x = 3, y=—5, and z = 4. 


30. We may write 
B = {v1,02,.--,Un} 
and 
y = {u1,U2,...,Un}. 


We have that Q = [I ie Now if @ is orthonormal, we may compute 


n 
= uy = D1 Qijvi- 
i=l 
Thus we know that the inner product of u, and u; would be 


(us, Ut) = oy QisYi; x QitVi) 
ial i=l 


= y QisQiz, 
i=1 


the value of inner product of the s-th and the ¢-th columns of Q. So it 
would be 1 if s = ¢ and it would be 0 if s + t. Finally the converse is also 
true since Q* = [I]} is also an unitary matrix. 


31. (a) Check that 
Al, (a+ cy) =x+cy-2(x+cy,u)u 


= (a — 2(x,u)u) + c(y - 2(y, u)u) 
= H,(x) +cH,(y). 
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(b) Compute 
H,(«) - x = -2(a,u)u. 


The value would be zero if and only if x is orthogonal to u since wu is 
not zero. 


(c) Compute 
H,(u) = u-2(u,u)u = u- 2u = -u. 


(d) We check H* = H,, by computing 
(x, A (y)) = (Hu(@),y) = (@ — 2(ar, u)u) 


a (x,y) = 2(x, u) - (y,u) 
and 
(v, Hu(y)) = (vy - 2(y, u)u) 


= (z,y) 2(x,u) ‘ (y,u). 


Also, compute 
Hy (a) = Hy(a - (a, u)u) 


Hi, (x) — 2(x,u) Hy, (u) 
(a — 2(x,u)u) + 2x, u)u =z. 


Combining H,, = H* and H? =I, we have H,,H* = H*H,, =I and so 
fT, is unitary. 


32. (a) We pick 6 to be the value (x,y). Thus we have 


wa 


(x, Oy) 7: (x, (x,y)y) 


= (y,x)(x, y) = (x, y){y, 2) 
= (x, (x, y)y) = (x, Oy) 


and so the value is real. Now pick u to be the normalized vector 


Tee: And compute 
x-Oy . «-Oy 
Bale) = 9 Kee 2 eT ou Ta Bu 
2 
0 0 
2||x|? - 2(a, 6y) 
(a - 6y) 
|x - Oy||? 
|x|)? - (x,y) + yl? 
(a - 6y) 
x - Oy||? 
=“2-x+ Oy = by. 
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(b) Pick u to be the normalized vector —“. Compute 
zy] 


Hi, (x) =x -2(x,u)u= a -2(2, a ) ca 


=o) eal 
fel? - 2(c,y) 
= eae 


Jc? = (x, y) + Ewe 
|z—yl? 
=U-XU+Y=Y. 


y) 


6.6 Orthogonal Projections and the Spectral The- 
orem 


1. (a) No. Orthogonal projection is self-adjoint by Theorem 6.24. But 
for general projection the statement is not true. For example, the 
transformation T(a,b) = (a+ 6,0) is a projection which is not self- 
adjoint. 


(b) Yes. See the paragraph after Definition of “orthogonal projection”. 
(c) Yes. This is the result of the Spectral Theorem. 


(d) No. It’s true for orthogonal projection but false for general pro- 
jection. For example, the the transformation T(a,b) = (a+ b,0) isa 
projection on W. But we have T(0, 1) = (1,0) is not the point closest 
to (0,1) since (0,0) is much closer. 


(e) No. An unitary operator is usually invertible. But an projection is 
generally not invertible. For example, the mapping T(a, b) = (a,0). 


2. We could calculate the projection of (1,0) and (0,1) are 


(C1, 0), 1,2) 
14, 2)1/? 


(1,2) = $(1,2) 
and 
((0,1),(1,2)) 
Ia.0P 


by Theorem 6.6. So we have 


(1,2)= £(1,2) 
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On the other hand, we may do the same on (1,0,0), (0,1,0), and (1,0, 0) 
with respect to the new subspace W = span({(1,0,1)}). First compute 


((1,0,0), (1,0, 1)) 
(1,0, 1) |? 


1 
(1,0; 1) a 5 (1,0, 1), 


((0, 1,0), (1,0, 1)) 
(1,0, 1)? 


(1,0, 1) > 0(1,0, 1), 


and 
((0,0,1),(1,0,1)) 
I|(1, 0, 1) |? 


Hence the matrix would be 


1 
(1,0, 1) = 9 (10, I): 


3. The first and the third step comes from the Spectral theorem and the 
fact that these matrices are self-adjoint or at least normal. So we only do 
the first two steps. Also, we denote the matrix E;; to be a matrix, with 
suitable size, whose ij-entry is 1 and all other entries are zero. Finally, 
it’s remarkble that that the matrices P and D are different from the each 
questions. They are defined in Exercise 6.5.2. 


(a) Let Az = P* E\,P and AL = P* EgoP. Then we have T3 a Las; 
T_1 = La, and 
jets (en ae 


(b) Let A; = P*E\,P and A; = P* EooP. Then we have T_; = La 
T; = La, and 


La = -1T_; + 1T;;. 


(c) Let Ag a P* Ey,P and A_1 = P* EgoP. Then we have Tz = Lag; 
T_y = La, and 
a eee Vee 


(d) Let Ay = P*E\,P and A_» = P* (E22 + E33)P. Then we have T, = 
La; T_2 = La, and 
La = 4Ty — 2T_. 


(e) Let Ay = P* Ey,P and Aj = P* ( Fo9+ E33)P. Then we have T4 = Lay, 
T, = La, and 
La =4T,+1T;. 


4. Since T is an orthogonal projection, we have N(T) = R(T)* and R(T) = 
N(T)*. Now we want to say that N(I-T) = R(T) = W and RU - 
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5. 


T) = N(T) = W* and so I -T is the orthogonal projection on W*. If 
xe N(I-T), we have x =T(x)€ R(T). If T(x) € R(T), we have 


(1-T)T (2) = T(x) -T?(x) = T(x) - T(x) = 0. 


So we have the first equality. Next, if (J -T)(#) « R(I -T) we have 


T(1-T)(a) = T(x) - T?(x) = T(x) - T(x) = 0. 


If « €e N(T) we have T(x) = 0 and so x = (I-T)(x) € R(I-T). So the 
second equality also holds. 


(a) Since T is an orthogonal projection, we may write V = R(T)@®R(T)*. 


wa 


So for each x € V we could write x = u+v such that ue R(T) and 
vé R(T)+. So we have 


(w+ v)]] = full s fut of = lel. 


The example for which the in which the inequality does not hold is 
T (a,b) = (a+ b,0), since we have 


ITC, 1)] = (2,0) = 2> 10, Il = v2. 


Finally, if the equality holds for all z ¢ V, then we have ||w/| = |u+ ll. 
Since u and v are orthogonal, we have 


jut ul? = ul? + Jol? 


So the equality holds only when v = 0. This means that x is always an 
element in R(T’) and so R(T) = V. More precisely, T is the idenetity 
mapping on V. 

If T is a projection on W along W’, we have V=W @W’. So every 
vector x € V could be written as x = u+v such that ue W andve W’. 


If W' + W+, we may find some ue W and ve W’ such that they are 


2 
not orthogonal. So (u,v) is not zero. We may pick t = Toa and 


calculate that 
| (tu + v) |? = |tull?. 


But now we have 
tu + v|? 7 tu? + 2Re(tu, v) + ale 
= ||tul|? — Jul? < |T(tu+ v)|?. 


So T must be an orthogonal projection. 


6. It’s enough to show that R(T)* = N(T). If «¢ R(T)*, we have 


(T(a),T(a)) = (x, T° T(a)) = (x, T(T"())) = 0 
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and so T(x) =0. If x ¢ N(T), we have 


(x, T(y)) = (T" (x), y) = 0 
since T*(a) = 0 by Theorem 6.15(c). Hence now we know that R(T)* = 
N(T) and T is an orthogonal projection. 
7. (a) It comes from Theorem 6.25(c). 
(b) If 


k 
Ty = T” = DPT}. 
i=1 


Now pick arbitrary eigenvector v; for the eigenvalue \;. Then we 
have : 
0 = T(u;) = Ce NET) (v4) = Ne U;. 
i=l 
This means that A? = 0 and so 4, = 0 for all 7. Hence we know that 


(c) By the Corollary 4 after the Spectral Theorem, we know that T; = 
g:(T) for some polynomial g;. This means that U commutes with 
each T; if U commutes with T. Conversely, if U commutes with each 


T;, we have 
TU = (SATU = SLU 
; i=l ; i=l 
=) MUTT; = UD) Ti) = UT. 
i=l i=l 
(d) Pick 
u-> dT, 


1 
where A? is an arbitrary square root of );. 


(e) Since T is a mapping from V to V, T is invertible if and only if 
N(T) = {0}. And N(T) = {0} is equivalent to that 0 is not an 
eigenvalue of T. 

(f) If every eigen value of T is 1 or 0. Then we have T = 07) +17) =, 
which is a projectoin. Conversely, if T’ is a projection on W along 
W’, we may write any element in V as u+v such that ue W and 
véeW’. And if A is an eigenvalue, we have 


u=T(utv) =X(ut+v) 


and so 
(1-A)u= dv. 


Then we know that the eigenvalue could only be 1 or 0. 
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(g) It comes from the fact that 


k 
T* = N. 
i=1 


8. It directly comes from that T* = g(T) for some polynomial g. 


9. (a) Since U is an operator on a finite-dimensional space, it’s sufficient 
to we prove that R(U*U)* = N(U*U) and U*U is a projection. If 
xe R(U*U)"*, we have 


(x, U°U(a)) = (U(2),U(2)) = 0 
and so U*U(x) = U*(0) =0. If xe N(U*U), we have 
(x,U*U(y)) = U'U(«),y) = 0 


for all y. This means that « « R(U*U)*. Now we know that V = 
R(U*U)® N(U*U) and we can write element in V as p+q such that 
pe R(U*U) and N(U*U). Check that 


U*U(p+q) =U*U(p) =p 


by the definition of U* in Exercise 6.5.30(e). Hence it’s an orthogonal 
projection. 


(b) Use the notation in Exercise 6.5.20. Let 
a= {v1, V2,---, UK} 


be an orthonormal basis for W. Extend it to be an orthonormal basis 


4 = {v1,V2,---,Un} 
for V. Now check that 
UU*U(v;) = 0 = U(v;) 
ifi>k and 
UU*U(v;) = UU*(U(v;)) = U(x) 


if i< k by the definition of U* in Exercise 6.5.20(e). They meet on a 
basis and so they are the same. 


10. We use induction on the dimension n of V. If n = 1, U and T will be 
diagonalized simultaneously by any orthonormal basis. Suppose the state- 
ment is true for n < k-1. Consider the case n = k. Now pick one arbitrary 
eigenspace W = E) of T for some eigenvalue \. Note that W is T-invariant 
naturally and U-invariant since 


TU(w) =UT(w) = U(w) 
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for allwe W. If W = V, then we may apply Theorem 6.16 to the operator 
U and get an orthonormal basis ( consisting of eigenvectors of U. Those 
vectors will also be eigenvectors of T. If W is a proper subspace of V, 
we may apply the induction hypothesis to Ty and Uy, which are normal 
by Exercise 6.4.7 and Exercise 6.4.8, and get an orthonormal basis , for 
W consisting of eigenvectors of Tw and Uw. So those vectors are also 
eigenvectors of T and U. On the other hand, we know that W* is also 
T- and U-invariant by Exercise 6.4.7. They are also normal operators by 
Exercise 6.4.7(d). Again, by applying the induction hypothesis we get an 
orthonormal basis 62 for W* consisting of eigenvectors of T and U. Since 
V is finite dimentional, we know that 6 = 6; U 62 is an orthonormal basis 
for V consisting of eigenvectors of T and U. 


11. By Theorem 6.25(a), we may uniquely write element in the space as 
V=az,tatgt...+ Lp 
such that x; « W;. If 7 #7, we have 
T;T;(v) = 0 = 6ijTi(v) 
by the definition of T;’s and Theorem 6.25(b). Similarly, if i = 7, we have 
T;T,(v) = 2; = O47; (v). 


So they are the same. 


6.7 The Singular Value Decomposition and the 
Pseudoinverse 


1. (a) No. The mapping from R? to R has no eigenvalues. 


(b) No. It’s the the positive square root of the eigenvalues of A* A. 
For example, the singular value of 2[2 is 2,2 but not eigenvalues 
of (21)* (21), which is 4,4. 


(c) Yes. The eigenvalue of A* A is o?. And the singular value of cA is the 
positive square root of the eigenvalue of (cA)*(cA) = |c\?A* A, which 
is |c\?o?. So the singular value of cA is |cc. 


(d) Yes. This is the definition. 


(e) No. For example, the singular value of 2/5 is 2,2 but not eigenvalues 
of (21)*(21), which is 4,4. 


(f) No. If Ax = 6b is inconsistent, then A‘b could never be the solution. 
(g) Yes. The definition is well-defined. 


208 


2. For these problems, choose an orthonormal basis a, usually the standard 
basis, for the inner product space. Write down A = [T].. Pick an or- 
thonormal basis 8 such that [T*T], is diagonal. Here the order of 8 
should follow the order of the value of its eigenvalue. The positive square 
roots of eigenvalues of A*A is the singular values of A and T. Extend 
T(G) to be an orthonormal basis y for W. Then we have 


B= {U1,U2,+++,Un} 


and 
. i {U1,U2,..-, Um}. 


(a) Pick @ to be the standard basis. We have 


B= {(2,0); (0, 1)}, 


r= (Z01y, 501-0, 244) 
and the singular values are V3, V2. 
(b) Pick 
asthe ofa y/se = yar" 1)}. 
We have 
B={fs, fi, fat, 
y= thi, fet, 
and the singular values are V45. 
(c) Pick 
1 I. 1 
a={fi= Tag P= foe aR 
We have 
B={ fa, fa, fi}; 
1={ZeOh+ fs) Oh dof 


and the singular values are V/5, /5, V4. 
(d) Pick a to be he standard basis. We have 


1 : 2 i+] 
p= (harp /2a- Sh 


1 ; 2 i+] 
ye thairy (2a), 


and the singular values are 2,1. 
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3. Do the same to Ly as that in Exercise 6.7.2. But the a here must be the 
standard basis. And the matrix consisting of column vectors § and y¥ is 
V and U respectly. 


(a) We have A* A = & a} So its eigenvalue is 6,0 with eigenvectors 


iL 1 
B= yuna oie mea 


Extend L.4(8) to be an orthonormal basis 


1 
(1,1, a 1,0), 


S|- 


1 
=A 0 0 
(b) We have 
1 0 0 
Lit, v2 0 0 
v= | ).2=[ J.v= G0. 4 
VaKt 2h 0 v2 0 01.0 
(c) We have 
2 3 
vo Oi UP ae V5 0 
1 1 a 1 1 1 1 1 
u= | YS Bln] 9 g|.v-a(t 4). 
2| V0 2 3 15 0) VEAL ood 
be ey ae ng 0 0 
V10 V3 VI5 
(d) We have 
ab tae 76 170. 26 
SO i ea le ee 
Bw VW 2 vi “vi 
(e) We have 
. : V2 1 
yal 1+a al Se V6 0 v-l3 @ 
Q\1-i 1+i}? 0 of? aL =il 
v6 V3 
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0 V6 0 0 


ro 
oreo 
oor © 


V2 
4. Find the singular value decomposition A = UXV*. Then we have W = 
UV* and P=VdV*. 


(a) We have 


(b) We have 


1 0 0 20 4 0 
W=10 0 1],P=|4 20 O}. 
010 Oo: A 


5. Use the notation in Exercise 6.7.2. Then we have 


Ty) = YS tyan)er 


O74 


1 +%Q+%3 Lo-X3 


T' (21,22, 23) = ( 3 ’ 2 


). 


(b) 
T' (a+ ba + cx?) = a) eae 
45 
(c) r | 7 
Tt(a+bsinz +.ccos2) = a (2b+ c)sinx + (-b + 2c) cose 
2 5 
(d) 


T' (2, 22) = 5(a + (1 = i) Zo, (1 +1)22). 


6. Use Theorem 6.29. So we compute At = VNTU*. 


(a) : 
di 5a “roa 
Ar=2(1 1 a, 


(b) 


(d) 
ie. ho Bi 
(yooh, SS a 
sc [i ne 1} 
3 3 3 
(e) 
1fl-7 147 
(f) 
She a 
§ 6 6. 
i ? 
i 
6 6 6 


7. Use the Lemma before Theorem 6.30. We have Z, = N(T)* and Z, = 
R(T). 


(a) We have Z; = span{v1, v2} = V and Z = span{ur, uz}. 
(b) We have Z; = span{v;} and Z = span{uy}. 

(c) We have Z, = Z2=V =W. 

(d) We have Z; = Z, = C?. 


8. If the equation is Az = b, then the answer is A'b. 
(a) The solution is 
dl 
(21,22) = (5 5) 
(b) The solution is 
1 1 
(#1, £2, £3, £4) 3 (510, 1, 5) 
9. (a) Use the fact that 
is hi | ty “Ak. Ges 
(T (u)v={ (u;,0) if g>T, 
= 01505 
for i<r. We know that T*(u;) = 0;v;. And so 
TT* (us) = T (0404) = 070; 
for 7 <r. Similarly, we know that, for 7>r, 
(T* (uz), v5) = 0. 
Hence TT*(u;) = 0 when i> r. 
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10. 


11. 


12. 


13. 


14. 


(b) Let T = L4 and use the previous argument. 
(c) Use Theorem 6.26 and Exercise 6.7.9(a). 
(d) Replace T by A. 


Let 6’ and 7 be the standard bases for F™ and F” respectly. Thus we 
have [L4]}, = A. Also, let 
B= Lis Voyad strat 


and 
y = {u1,U2,-.-,Un}. 


By Theorem 6.26 we know that [La] =X. Apply Exercise 2.5.8 we have 
A=(La]}, = (11) [La] }[1]5, = USV"*. 
(a) Since T is normal, we have that T*(x) = Ax if T(a) = Ax. Hence we 
know that the eigenvalues of T*T are 
|Ar}?, |Aal?, .-.5 AnI?- 
So we know the singular values are 
[Ai], [Aa], -- + [Anl- 
(b) Replace T by A. 


Let 0; = 


Dal” Now we know that 


Av; = AGU; -_ O;|A;|v;. 
This means that AV = UX and so A=UXV*. 


If A is a positive semidefinite matrix with eigenvalues ;’s, we know that 
A* = A and so 4; is real and nonnegative. Furthermoer, we have 


A* A(x) = A?(x) = A2a. 


Since \; > 0, we have \/A? = A;. Hence the eigenvalues of A are the singular 
values of A. 


Consider 
A? = A*A=VOU*UXV* = VEPV* = (VEV")?. 


Both of A and V=V* are positive definite, we know that A = VUV™* by 
Exercise 6.4.17(d). So we know 


VuV* =UxV". 
Since A is positive definite, we know © is invertible. Also, V is invertible. 


Hence we get U = V finally. 
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15. (a) Use the fact that 
A* A= P*W*WP =P? 


and 
AA* =WPP*W* =WP?W*. 


So A*A = AA* if and only if P? = WP?W*, which is equivalent to 
WP? = P?W. 


(b) By the previous argument we have A is normal if and only if 
P? =WP°W* = (WPW’*)’. 


Since P and W PW% are both positive semidifinite, again by Exercise 
6.4.17(d), we have the condition is equivalent to P = WPW* or 
PW =WP. 


16. Use the singular value decomposition A = UNV*. Let P = UNU* and 
W =UV*. Then we have A= PW and P is positive semidefinite and W 
is unitary. 


17. (a) We calculate (UT)! and U', T" separately. First we have 
UT (a1, 22) = U(a1,0) = (21,0) =T". 


So we compute 77 directly. We have N(T) is the y-axis. So N(T)+ 
is the z-axis. Since we have 


T(1,0) = (1,0), 


we know that 
T (10). = (1,0) 


and 
T' (0,1) = (0,0). 


Hence we have 
(UT)! (21,22) = T"(a1, 22) = 21 T'(1,0)+22T" (0,1) = 217"(1,0) = (21,0). 


On the other hand, we also have N(U) is the line span{(1,-1)}. So 
N(U)* is the line span{(1,1)}. Since we have 


U(1,1) = (2,0), 


we know that 1 
U"(1,0) = 50,1) 


and 
U'(0,1) = (0,0). 
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18. 


19. 


wa 


wa 


a 


aH 


Hence we have 
XX 


ale 


U' (a1, x2) = 2,U'(1,0) + x2U*(0, 1) = ( 2° 9 


Finally we have 


Ly Ly 


T'U' (21,22) Sri 2°92 


)= (5.0) # UT)!(a1,22). 


Let A = (; i} and B = E 4, By the previous argument, we 


1 

have Al = i :) and Bi = B. Also, we have AB = B and so 
2 

(AB)i = Bi=B. 

Observe that if A = UXV™* is a singular value decomposition of A, 


then GA = (GU)XV* is a single value decomposition of GA. So we 
have 

(GA)t = Val(GU)* = ATG". 
Observe that if A = UXV™* is a singular value decomposition of A, 


then AH = UXV* H** = UX(A*V)* is a single value decomposition 
of AH. So we have 


(AH)! = H*V>tU* = H* Al, 


The nonzero singular values of A are the positive square roots of the 
nonzero eigenvalues of A*A. But the eigenvalues of A*A and that 
of AA* are the same by Exercise 6.7.9(c). Hence we know that the 
singular value decomposition of A and that of A* are the same. 
Also, we have 

(A')* Ab = AA’ = AA*. 
Since AA’ is self-adjoint, its eigenvalues are always real. We get that 
if AA*(x) = Az, then we have 


(A‘)* A*(z) = AA*(2) = AF = AZ, 


here A means the matrix consisting of the conjugate of the entries of 
A. Hence the singular value of A‘ and that of A* are all the same. 
Let A = UXV* be a singular value decomposition of A. Then we 
have A* = VX*U*. So 


(A*)t = U(a*)TV* = Ut) *V* = (AT). 


(c) Let A = UXV* be a singular value decomposition of A. Then we 


have A‘ = (V*)'X'U®. So 
(at = (H)ENIVE = CUEN'V" = (ANE, 
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20. Let A= UV* be a singular value decomposition of A. Then we have 
O = A? =UXV*UYY, 


which means 


“uV*UL =O 


since U and V are invertible. Now let {a;}/_, is the set of those singular 
values of A. Denote D to be the diagonal matrix with Dj; = + ifi<r 


while D;; = 1 if i>r. Then we have XD = DY = 1. This means that 


vty*Un! = DYV*UND = DOD=O. 


Now we have 
(At)? = (VatU*)? = Votu*Votu* 


=V(x'Vv*U>')*U* = VOU" =O. 
21. Here we use the notation in 6.26. 


(a) Compute 
TT'T(w) = TT" (ojuj) = T(v:) 


ifi<r, while 
TT'T(v;) = TT" (0) = 0=T(v%) 


ifi>r. 
(b) Compute 
T'TT'(u;) = TtT(L»,) = T'(u;) 
O% 
if i <r, while 
T'TT' (uj) = TIT(0) = 0 = TT(u;) 
ifi>r. 


(c) Pick an orthonormal basis a. Let A be the matrix [T], and A = 
UXV* be a singular value decomposition of A. Then we have 


(At A)* = (VitU*UdV*)* = (VitZV*)* = ATA 


and 
(Ady S(USVVEU*) SCE y= ast 


22. Observe that UT is the orthogonal projection on R(UT) by Theorem 
6.24 since it’s self-adjoint and UTUT = UT. We have that R(UT) = 
R(T*U*) c R(T"). Also, since 


UTT* (a) =T*U*T* (x) =T* (2), 
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23. 


we have R(T*) c R(UT) and hence R(T*) = R(UT). This means UT 
and T'T are both orthogonal projections on R(T*) = R(UT). By the 
uniqueness of orthogonal projections, we have UT = T'T. 
Next, observe that TU is the orthogonal projection on R(TU) by Theorem 
6.24 since it’s self-adjoint and TUTU = TU. We have that R(TU) c R(T). 
Also, since 

TUT(«) =T(2), 
we have R(T) c R(TU). By the same reason, we have TU = TT? and 
they are the orthogonal projection on R(T’) = R(TU). 


Finally, since we have TU — TT’ = Ty, we may write is as 

T(U -T*)=Tp. 
We want to claim that R(U-T')n N(T) = {0} to deduce that U-Tt = Tp. 
Observe that R(T') = N(T)* = R(T*). Also, we have R(U) c R(T*) 
otherwise we may pick x ¢ W such that U(x) « R(U)\R(T*) and get the 


contradiction that 
0+ U(x) = UTU(«) =0 


since UT is the orthogonal projection on R(T”). Now we already have 
R(U -T') c R(T*) = N(T). Hence the claim 


R(U -T')n N(T) = {0} 


holds. This means that T(U - T')(x) = 0 only if (U-T')(a) = 0. Since 
we have T(U-T") = Tp, now we know that actually U-T' = Ty and hence 
ert, 


Replace T by A. 


24. Replace T by A. 


25. 


(a) By Exercise 6.3.13 T*T is invertible. Let U = (T*T)7!T*. Check 
that 
GMOU Wee WO age Bp ead had eae 
and 
DTU Dy eRe Ty. f=, 
Also, both TU = T(T*T)!T* and UT = (T*T)"'T*T = I are self- 
adjoint. Apply Exercise 6.7.21 and get the result. 
(b) By Exercise 6.3.13 TT* is invertible. Let U = T*(TT*)~+. Check 
that 
LURSLE (PE) ch ar 
and 
OLU SEE Ee Ia yo au, 
Also, both TU = TT*(TT*)! = I and UT = T*(TT*)"'T are self- 
adjoint. Apply Exercise 6.7.21 and get the result. 
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26. 


27. 


By Theorem 6.26, we know [T ibe = ¥ for some orthonormal bases 3’ and 
+’, where © is the matrix in Theorem 6.27. In this case we know that 


((TYy,)t = ot = hye, 
Now for other orthonormal bases 8 and y. We know that 


Weal ake Baa 


is a singular value decomposition of [Z']} since both [J]?, and [I iG are 
unitary by the fact that all of them are orthonormal. Hence we have 


(fy Sie iaie =W Ar eine =e: 


Use the notation in Theorem 6.26. By the definition of T', we have 
TT! (a) = LL“\(a) = «. 


Also, if  € R(T)", again by the definition of T', we have TT'(x) = 0. 
Hence it’s the orthogonal projection of V on R(T). 


6.8 Bilinear and Quadratic Forms 


1. 


(a) No. A quadratic form is a function of one variable. But a bilinear 
form is a function of two variables. 


(b) No. We have 4I = (27)'I(2I). But 4/ and 2/ has different eigenval- 
ues. 


(c) Yes. This is Theorem 6.34. 
(d) No. See Example 5 of this section. The matrix bi | is a coun- 
terexample when F = Zo. 


(e) Yes. Let H, and Hz be two symmetric bilinear forms. We have 
(Ai + H2)(x,y) = Mi (x,y) + Ho(a,y) 
= Ay(y,x) + Ho(y,x) = (A, + Hy)(y, x). 
(f) No. The bilinear forms 


5. hte 0 6 {EA 
Ay(a2,y) = 2 F 1) a Faesy) = ( i) 


have matrix representations (; i) and ic ) respectly with the 


standard basis. But both of their characteristic polynomials are (1 - 
ame 
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2 


3. 


(g) No. We must have H(0,0) = 0 since H(2,0) is a linear function of x. 
(h) No. It’s n? # 2n for n #2 by the Corollary 1 after Theorem 6.32. 


(i) Yes. Pick a nonzero element u € V arbitrarily. If H(x,u) = 0, then 
we have y = u. Otherwise pick another nonzero element v € V such 
that {u,v} is independent. Thus we have 


y = H(2,v)u- H(a2,u)v #0. 
But we have 
H(2,y) = A(2,v)H(a,u) - H(a2,u)A(2,v) = 0. 
(j) No. It needs one more condition that H is symmetric. For example, 


1 
0 i has its congruent matrix 


(5 o}o=(5 allo able a)=(ce su): 


If that congruent matrix is diagonal, we should have bc = ad = 0. If 
a=b=0ora=c=0, then Q is not invertible. Similarly, it cannot be 
d=c=O0ord=b=0. So this bilinaer form is not even diagonalizable. 


the matrix ic 


The property 1 comes from the definition of a bilinear form. The property 
2 comes from that L,(0) = R,(0) = 0. The property 3 comes from the 
computation 


H(a+y,z+w) =HA(2,z+w)+A(y,z+w) 


= H(x,z)+H(2,w)+ A(y,z)+ Ay, w). 


Finally, since the conditions in the definition of a bilinear form is sym- 
metric for the first and the second component. So we get the property 
4. 


(a) Check that 
(A, + H2)(ax%, +22,y) = Hi (ar, + 22,y) + Ho(ar1 + x2,y) 


= aH (21,y) + Hy (2,y) + aHs(21,y) te Ap(#2,y) 
= a( A + H2)(x1,y) + (Ai + A2)(x2,y) 
and 
(Hy + H2)(x, ay1 + ye) = Ai (2, ays + yo) + Ho(2, ay + y2) 


= aH (x,y) + Hi(a,y2) + aHo(x,y1) + Ho(2, yo) 
= a(H, + H)(x,y1) + (Ay + Ae) (2, ye). 
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(b) Check that 
cH (ax + 22,y) = caH (21, y) + cH (x,y) 


= acH(21,y)+cH(ax2,y) 
and 
cH (x, ay1 + yo) = caH (x, y1) + cH (2, yo) 
=acH(2,y,) + cH (a, yz). 


(c) Pick Ho(x,y) = 0 as the zero element and check the condition for a 
vector space. 


4. (a) Yes. The form f x g is bilinear and the integral operator is linear. 
(b) No. If J(x,y) #0 for some x and y, then we have 
H(cx,y) = [J(ex,y)]° = [J (x,y)? # cH (2,9). 


(c) No. We have 
H(2,1) =4+#2H(1,1) <6. 


(d) Yes. The determinant function is an n-linear function and now n is 
2. 


(e) Yes. When the field if R, the inner product function is a bilinear 
form. 


(f) No. It fails when F = C. If we pick V = C and choose the standard 
inner product. Thus we have 


A(1,?) = (1,7) = -147H(1,1) =%. 
5. See the definition of the matrix representation. 


(a) It’s a bilinear form since 


ay by ay : 1 -2 0 by 
Hy} |a2],Jo.]}=]a2] Jl 0 0 J] be]. 
a3 b3 a3 0 0 -1 b3 


The matrix representation is 


0 2 -2 
2 0 -2]. 
1 1 O 


(b) It’s a bilinear form since 


ay 1001 by 
H ay ag by ba _ ag 0 0 0 0 ba 
a3 ag , b3 ba a a3 00 0 0 b3 


The matrix above is the matrix representation with respect to the 
standard basis. 
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(c) Let 
f =a, cost + ag sint + ag cos 2t + a4 sin 2t 


and 
g = 6; cost + bg sint + bg cos 2t + bg sin 2t. 


We compute that 

H(f,g) = (a2 + 2a4)(-bi - 463) 
a\'(0 0 0 0 
ag -1 0 -4 0 bg 
a3 0 0 0 O 
a4 -1 0 -4 0 b4 


Hence it’s a bilinear form. And the matrix is the matrix representa- 


tion. 
ay by ay : 0 1 by 
(()-C))-(e) (tole) 
Hence we find the matrix A. And the form 2! Ay is a bilinear form. 
(a) Check that 
T(H)(ax1,22,y) = H(T(ax1 + x2),T(y)) 
= H(aT(x1) + T(x2),T(y)) = aH (T(21),T(y)) + H(T(x2),T(y)) 
= aT(H)(21,y) +T(H)(22,y) 


and as 
T(H)(x, ay, + yo) = A(T(2,T (ays + y2)) 


= H(T(a),aT (yi) + T(y2)) = aH (T(x), T(y1)) + (T(x), T(y2)) 
= aT(H)(x,y1) + T(H) (a, y2). 
(b) Check that 
T (cH, + H2)(w,y) = (cH1 + H2)(T(«),T(y)) 
= cH, (T(x), T(y)) + Ho(T (2), T(y)) 
= [cT(H,) + T(H)](2,y). 


(c) Suppose T is injective and surjective. If H is an nonzero bilinear form 
with H(x1,y,) #0 for some x1, y, € W and T(H) is the zero bilinear 
form, we may find 29, yo € V such that T(29) = x, and T(yo) = 1 
since T is surjective. Thus we’ll have 


O= T(H)(xo, 21) = H(a,y) #0, 
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a contradiction. This means that T is injective. On the other hand, 
since T is an isomorphism, the inverse of T exists. Then for each 
H € B(V), we can define 


Ho(x,y) = Ho(T™'(x),T'(y)) 
such that _ 
T(Ho) = H. 
8. (a) Let 6 = {u;}. We know that (Wg(A))i; = H(vi,v;). So we have 
(we (cH + H2))i; = (cH, + H2)(v;, v;) 
= cH, (vj, 05) + H2(vi, 05) = e(a(Ai) aj + (Wp (Aa) az. 
(b) The form H’(u,v) := u'Av is a bilinear form when u,v ¢ F". We 
know that ¢g is an isomorphism from V to F”. This means that 
H = 631(H') 
is a bilinear form by Exercise 6.8.7. 
(c) Let 8 = {v;}. And let 


n n 
v= Yai, y = ye Uj. 
i=1 t=1 


Thus we have - hn 
(x,y) = iy, A,Vi, > bjv;) 
i=l 


i=l 
a aid; H (vi, v5) = [a(x] Aloa(y)]. 
iJ 

9. (a) It comes from the fact that dim(Mnyxn(F)) =n. 


(b) Let E;; be the matrix whose ij-entry is 1 and other entries are zero. 
Then we know that {£;,;};; is a basis in M,x,(F). Since wg is an 
isomorphism, the set We ({ Eig }s9) is a basis for B(V). 

10. The necessity comes from Exercise 6.8.8(c). For the sufficiency, we know 
that 
(wa(H))iz = H(i, vj) = ef Ae; = Ais, 


where vu;,v; are elements in 8 and e;,e; are the elements in the standard 
basis in F”. 


11. Pick @ to be the standard basis and apply the Corollary 3 after Theorem 
6.32. Thus we have [¢g(x)] = 2. 


12. Prove the three conditions. 


222 


reflexivity We have A is congruent to A since A= J‘ AI. 


symmetry If A is congruent to B, we have B = Q'AQ for some in- 
vertible matrix Q. Hence we know that B is congruent to A since 
A= (Q)¢AQ™ 

transitivity If A is congruent to B and B is congruent to C, we have 
B=Q‘AQ and C = P'BP. Thus we know that A is congruent to C 
since C = (QP)'A(QP). 


13. (a) If x is an element in V, then ¢,(x) and ¢g(z) are the y-coordinates 
and the 6-coordinates respectly. By the definition of Q, we have 


a(x) = Lag,(2) 
for allae V. 
(b) By the Corollary 2 after Theorem 6.32, we know that 
H(2,y) =[6y(«)]'b+()[$4(y)] = [¢8(#)]'¥a(A)[oa(y)]. 


By the previous argument we know that 


[¢,(2) ]'Q've(H)Q[¢,(y)] = [44(#)]'4,(A)[¢,(y)], 


where Q is the change of coordinate matrix changing y-coordinates 
to 6-coordinates. Again, by the Corrolary 2 after Theorem 6.32 we 
know the matrix Q'ys(H)Q must be the matrix ~(H). Hence they 
are congruent. 


14. Since they are congruent, we have 


Q'va(H)Q = b,(H) 


for some invertible matrix Q. But invertible matrix will preserve the rank, 
so we know their rank are the same. 


15. (a) If A is a square diagonal matrix, then we have A;,; = Aj; = 0. 


(b) If A is a matrix congruent to a diagonal matrix B, then we have 
B=Q'AQ and A=(Q™')'BQv!. This means A is symmetric since 


At = (Q)'BIQT = (QUY'BQ?= A. 


(c) Say a to be the standard basis and £ to be the basis in Theorem 6.35. 
Let H =~,'(A) be the bilinear form whose matrix representation is 
A. Thus we know that ~q(H) = A and 6(H) are congruent. Also, 
by Theorem 6.35 we know that wg(#) is diagonal. 


16. If K(x) = H(a,x), then we have 


K(a+y)=HA(e+y,c+y) = A(2,x)+2H (x,y) + A(y,y) 
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= K(x) +2H(x,y) + K(y). 


If F is not of characteristic two, we get the formula 
1 
H(z,y) = s[K(e+y) -K(z) - K(y)]. 


17. Use the formula given in the previous exercise to find H. To diagonalize 
it, we may use the method in the paragraph after Theorem 6.35. Here the 
notation a is the standard basis in the corresponding vector spaces. 


(a) We have 


(72) (os) = 5% (tas) -* (Ca) (i) 


= —2a 1b, a 2a bo + 2a9b, a Agbo. 


-2 
2 


Cr i)(2° alo )-(w 5) 


So the basis 8 could be 


Also, we have ¢g = ( i) Hence we know that 


{(1,0), (1, 1)}- 


(b) We have 
A ((“:) 5 (":)) = 7a,b 1 _ 4a; be _ 4agb, + agbo 
and 4 
B = {(1,0), (., 1)}. 
(c) We have 


ay bi 
A a2], bg = 34,6, + 3aqbo oF 3a3b3 = a,b3 - a3). 
a3 b3 


B = {(1,0,0), (0, 1,0), G0. 1}. 


and 


18. As what we did in the previous exercise, we set 


ty 
K (: = 3t7 + 3t3 + 3t3 - Qtits 
ts 
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and find 


ay by 
A a2], by = 3a1b1 +t 3a2b2 + 3a3b3 - a,b3 - a3b 1 
a3} \bs 


such that H(«,x) = K(«) and H is a bilinear form. This means that 


ty 3.0 -1\ (ti 
K|t2}=(t1 to ts)}0 3 0 ][te 
ts -1 O 3 t3 


1 1 1 1 
we 5 9 \ (A 0 Noe 8 “y3) (2 
=(ti t ts)] 0 0 17/0 2 O]J] 2 0  |/ te}. 
-- + o}\o 0 3/\o 1 Oo 
v2 V2 
Note that here we may diagonalize it in sence of eigenvectors. Thus we 
pick 
1 1 1 1 


Ja a 7a 


), (0, 1,0).} 


And take 


Thus we have 

4 0 0O\ /t) 
Bt] + 3t5 + 3t3-2tt-(t, th t5)[0 2 Of t) 
0 0 3) \ts 
= A(t))* + 2(t3)? + 3(t5)?. 
Hence the original equality is that 
A(ti)? + 2(t5)? + 3(t§)? + Lo.t = 0, 
where |.o.t means some lower order terms. Hence S is a ellipsoid. 


19. Here we use the noation in the proof of Theorem 6.37. Also, the equation 
me yl Lar 
Yo (=r - €)83 < f(x) < (EX + O87 
j=l 2 ist 2 


is helpful. 


(a) Since 0 < rank(A) <n and A has no negative eigenvalues, we could 
find a positive eigenvalue 4; of A. Then take x = s;v;. Then we'll 
have that 


f(a) > (5-982 > 0= f(0). 


We may pick s; arbitrarily small such that ||2—-p|| could be arbitrarily 
small. Hence f has no local maximum at p. 
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(b) Since 0 < rank(A) < n and A has no positive eigenvalues, we could 
find a negative eigenvalue A; of A. Then take x = s;v;. Then we'll 
have that 


fle)< GX +6)s? <0= f(0). 


We may pick s; arbitrarily small such that ||2—-p|| could be arbitrarily 
small. Hence f has no local minimum at p. 


20. Observe that D is the determinant of the Hessian matrix A. Here we 
denote 1, Az to be the two eigenvalues of A, which exist since A is real 
symmetric. 


(a) If D> 0, we know that \, and Az could not be zero. Since op) > 0, 


att 
2 
we have LP) > 0 otherwise we’ll have D < 0. Hence the trace of A 


is positive. Thus we have 
Aq + AQ = -tr(A) <0 
and 
Ay A2 =D>0. 


This means that both of them are negative. Hence p is a local mini- 
mum by the Second Derivative Test. 


(b) If D <0, we know that A; and 2 could not be zero. Since o £(p) <0, 
1 
o it de 0 otherwise we’ll have D > 0. Hence the trace of A 
2 
is negative. Thus we have 


we have 


M1 + Ag = —tr(A) > 0 
and 
Ay A2 =D<0. 


This means that both of them are positive. Hence p is a local maxi- 
mum by the Second Derivative Test. 


(c) If D <0, we know that A; and Az could not be zero. Also, we have 
AyA2 =D<0. 


This means that they cannot be both positive or both negative. 
Again, by the Second Derivative Test, it’s a saddle point. 


(d) If D = 0, then one of A, and Az should be zero. Apply the Second 
Derivative Test. 
21. As Hint, we know that 
E'A=(A‘E)’. 
That is, do the same column operation on At. This means that do the 
same row operation on A. 
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22. See the paragraph after Theorem 6.35. 


(a) Take 
1 O 1 -3 1 0 
e i) a(j et a 
(b) Take 
L Aya =P 72s. 0 
(ali ¥)-6 2) 
(c) Take 


1-4 2 1 0 0) /2 0 0 
O 1 OJAl-; 1 Of=[0 4 Of. 
Of 2 2 04% 0 0 -l 


23. Since each permutation could be decomposed into several 2-cycle, inter- 
changing two elements, we may just prove the statement when the per- 
mutation is 2-cycle. Let A be a diagonal matrix and B be the diagonal 
matrix obtained from A by interchanging the it-entry and the 7j-entry. 
Take E’ be the elementary matrix interchaning the i-th and the j-th row. 
Then we have FE is symmetric and EAE = B. 


24. (a) Compute that 
A(ax, + ©2,Y) = (ary + £2,T(y)) 
= a(r1,T(y)) + (x2, T(y)) 
=aH(x1,y) + H(x2,y) 
and 
A(x, ayy + yo) = (x, T (ay + yo)) = (2, aT (a1) + T(x2)) 
= a(x,T(y1)) + (x, T(y2)) 
= aH(x,y1) + A(x, y2). 
(b) Compute that 
Ay, x) = (y,T(«)) 
=(T(x),y) = (a, T"(y)). 
The value equal to H(x,y) = (x,T(y)) for all x and y if and only if 
eee ae 
(c) By Exercise 6.4.22 the operator T must be a positive semidifinite 
operator. 


(d) It fail since 
(a, ty) = (x,T(iy)) = (x,iT(y)) 
= -i(x,T(y)) + iH (x,y) 


in genereal. 
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25. Let A = 18(H) for some orthonormal basis 3. And let T be the operator 
such that [T]g = A. By Exercise 6.8.5 we have 


H (x,y) = [¢a(2)]'A[oa(y)] = [46(2) "TT ]alea(y)] = [oa(x)]'Loa(T(y))I- 
Also, by Parseval’s Identity in Exercise 6.2.15 we know that 


(x,T(y)) = a x, vii T(y), vi) = [¢a(2)]}'[de(T(y))] 


since § is orthonormal. 


26. Use the Corollary 2 after Theorem 6.38. Let p, g be the number of positive 
and negative eigenvalues respectly. Then we have 


ptqsn. 


Hence we have 


Cres ee wear) 


possibilities. 


6.9 Einstein’s Special Theory of Relativity 


1. (b) It comes from that T,(e;) =e; for i = 2,3. 
(c) By the axiom (R4), we know that 


ti 


a a 
0 0 
Ty Oo} | 0 
d d' 


(d) We compute that, for i = 2,3 and j =1,4, 
(Ty, (ei), €9) = (eas To(e3)) = 
by the fact that T,,(e;) € span({e1, e4}). Hence we know that span({e2, e3}) 
is T-invariant. 
On the other hand, we compute that, for 1 = 2,3 and 7 = 1,4, 
(Ty (e;), a) = (3, Tu (es) = (ej, €4) = 0. 
So span({e1,e4}) is T7-invariant. 


2. We already have that 
(T, LaT,(w),w) = 0 


if (L4(w), w) = 0 for w € R* whose fourth entry is nonnegative. Now if we 
have (L4(w),w) = 0 for some w is a vector in R* whose fourth entry is 
negative, then we have 


0 = (T" LaT,(-w),-w) = (-1)?(T* LaT, (w), w) =(T7 LaT,(w),w). 
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3. (a) The set {w1, w2} is linearly independent by definition. Also, we have 
wi = e, + e4 and we = e; - eg are both elements in span({e1, e4}). 
Hence it’s a basis for span({e1,e4}). Naturally, it’s orthogonal since 
(wi, We) =0. 


(b) For brevity, we write W = span({e1,e4}). We have T,(W) c W and 
T;(W) c W by Theorem 6.39. Also, W is L4-invariant if we directly 
check it. Hence W is Ty D4T,-invariant. 


4. We know that BY AB, = [TZ LaTy]g and [L4]g. So (a) and (b) in the 
Corollary is equivalent. We only prove (a) by the steps given by Hints. 
For brevity, we write U =T7 DaT, and C = By AB,. 


(a) We have U(e;) = e; for 1 = 2,3 by Theorem 6.39. By Theorem 6.41 
we have 


U(e1) + U(eq) = U(e1 + 4) = U(w1) = awe = aez - aea, 
U(e,) - U(eq) = U(e1 — €4) = U(we) = bw, = be, + bea. 


Solving this system of equations we get that 
U(e1) = pei - qea, 
U(e4) = ge1 — peo, 


where p = ab and q = a Write down the matrix representation of 


U and get the result. 
(b) Since C is self-adjoint, we know that q =-q and so q=0. 


(c) Let w = e2+e4. Then we know that (L4(w),w) = 0. Now we calculate 
that 
U(w) = U(eg + €4) = €2 — peg. 


By Theorem 6.40 we know that 
(U(w), w) = (e2 - pea, e2 + ea) = 1-p=0. 
Hence we must have p = 0. 


5. We only know that 


0 vt" 
0 0 
Ty Oo}; | O 
1 $l 
for some t” > 0. Compute that 
0 0 vt" -vt" 
“3 0 0 0 0 
(e LaT, ol’lo = (La 0 ) 0 s (t")?(v? - 1) 
1 1 t" tl" 
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0 0 0 0 
: o} fol_,, fo} fof. 
(T? LaT, ol’lo =(L4 ol’lo )= 1 
1 a 1 1 


HW\2 (1,2 eS an cL: 
by Theorem 6.41. Hence we have (t”)*(v* - 1) = -1 and so t” = a 


. Note that if Sy is moving past S$; at a velocity v > 0 as measured on S, 
the 7}, is the transformation from space-time coordinates of S; to that of 
Sy. So now we have 

Ty, 2 SS, 

TE, :S's ue 

Ty, 1 SS". 


The given condition says that T,,, = T,,Ty,. This means that 


1 —v2 1 U1 
Ra ghee ane gee 
0) 1 0 0 0 1 0 0 
Pigh tS 0 0 1 0 0 QO. <1 0 
-—v2 1 V1 1 
1-v3 va 1-v2 \/ 1-0? Ve 1-v? 
1l+v2v1 0 0 —V2Q-U1L 
J G-02) (1-0?) J a-02)(1-0?) 
- 0 1 0 0 = 
~ 0 0 1 0 << e 
Vv2-U1 0 0 1+v20v1 


Hence we know that 
1+v2v1 Set 
V0) 1-0?) Viv} 
Japan Vv er , 


By dividing the second equality by the first equality, we get the result 


U1 + V2 


U3 = . 
1+ U1 V2 


. Directly compute that (B,)~' = Bc-v). If.$" moves at a negative velocity v 
relative to S. Then we have S moves at a positive velocity v relative to S$”. 
Let T, be the transformation from S' to $’. Then we have [T;']4] = By-v) 
and so [Ty]¢ = By. 


. In point of view of Earth, the astronaut should travel 2 x 99/0.99 = 200 
years. So it will come back in the year 2200. However, let T,, and T_, be 
the transformation from the space on Earth to the space of the astronaut 
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in the tour forward and the tour backward respectly. We may calculate 


that 
99 0 0 
T 7 a 0 ol 
sal ie V1— 0.992 0 ~{ 0 7 
100 100-99 0.99) \14.1 


Hence the astronaut spent 14.1 years to travel to that star measured by 
himself. Similarly, we may compute that 

99 0 0 

0 1 0 0 

0 


ce a 
0 V1-0.992 0 
Sat 


—100 -100 + 99 x 0.99 


Hence he spent 14.1 years to travel back in his mind. Combining these 
two, he spent 28.2 years. Hence he would be return to Earth at age 48.2. 


We 


9. (a) The distance from earth to the star is b as measured on C. 


(b) At time ¢, the space-time coordinates of the star relative to S’ and 


C"’ are 
b b-vt 
Pe 0 
“10 JS1—- v2 0 
t 


t— bu 


(c) Compute 2’ + t’v to eliminate the parameter t by 


b(1—v?) = bV1- v2. 


1 
ze’ -t'v= 
V1—-v?2 


Hence we get the result. 

(d) i. The speed that the star comes to the astronaut should be 
da’ | 

dt! 

Hence the astronaut feel that he travels with the speed v. 

ii. In the astronaut’s mind, he leave earth at t’ = 0. Hence in S the 


earth is at bV1—-v?. 


6.10 Conditioning and the Rayleigh Quotient 


2 
1. (a) No. The system & | is well-conditioned but its condition number 


is k, which can be arbitrarily large. 
(b) No. This is the contrapositive statement of the previous question. 
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(c) Yes. This is the result of Theorem 6.44. 
(d) No. The norm of A is a value but the Rayleigh quotient is a function. 
(e) No. See the Corollary 1 after Theorem 6.43. For example, the norm 


of is | is 1 but the largest eigenvalue of it is 0. 


2. Let A be the given matrix. Use the Corollary 1 to find the norm. 


(a) The norm is V18. 
(b) The norm is 6. 


V177+15 


(c) The norm is 73 


3. If B is real symmetric, then we have B*B = B?. If X is the largest 
eigenvalue of B, then we have )? is also the largest eigenvalue of B?. 
Apply the Corollary 1 after Theorem 6.43 and get that ||Bl| =A. If B is 
not real, the eigenvalue of B may not be a real number. So they are not 
comparable. Hence we need the condition that B is a real matrix here. 


4. (a) Use the previous exercise. We know that |Al] = 84.74 and ||A7'| = 


TOOEEE. am ~ 17.01. And the condition number is the ratio 
4.74 
cond(A) = asic ~ 1443. 
0.0588 


(b) We have that 
|z- Ab] = || A-*(b- AZ)|| < | At || |b - AZ| = 17.01 x 0.001 = 0.017 


and 
|Z - Aol na(A) [22h [do] 14.43 
Ato)” Jo] ~ lel 
by Theorem 6.44. 
5. Use Theorem 6.44. Thus we get 
loa |b] _ 0.1 
ond( A) 100 x =10 
Tel © ta 1 
and 
|| 1 ||| 1 O1 


= 0.001. 


> = x 
|| ~ cond(A) |b] 100 1 
6. Let x = (1,-2,3). First compute that 
(Bx, x) 
|x|? 
_ ((3,0,5),(1,-2,3)) — 9 
14 7 


R(x) = 
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10. 
11. 


Second, compute the eigenvalues of B to be 4,1,1. Hence we have ||B|| = 4 
and ||Bu!|, =1-! =1. Finally calculate the condition number to be 


cond(B) =4x1= 4. 


. Follow the proof of Theorem 6.43. We have 


bey rilas|? An pee |a;|? 7 An |||? = 


R(x) = > 
I? alle alle 


Ne 


And the value is attainable since R(v,) = An. 


. Let \ be an eigenvalue of AA*. If A = 0, then we have AA* is not invertible. 


Hence A and A* are not invertible, so is A*A. So A is an eigenvalue of 
A* A. 

Suppose now that A + 0. We may find some eigenvector x such that 
AA*x = Xx. This means that 


A* A(A* x) = \A* a. 


Since A*z is not zero, \ is an eigenvalue of A* A. 


. Since we have Ada = 6b and x = A-'b, we have the inequalities 


5] 
|x|] = 
[Al 


and 
jx < JA TLE. 
Hence we get the inequality 


Joel) oot 1 del 
lel JAN A HOE PAT At al 


This is what we proved in the previous exercise. 


If A= kB, then we have A* A = k?J. So all the eigenvalues of A* A are k?. 
Thus we have || Al] = & and ||A~!|| = k-t. Hence the condition number of A 
iskek =. 

Conversely, if cond(A) = 1, we have \; = A, by Theorem 6.44. This means 
that all the eigenvalues of A*A are the same. Denote the value of these 
eigenvalue by k. Since A” A is self-adjoint, we could find an orthonormal 
basis 3 = {v;} consisting of eigenvectors. But this means that 


A* A(v;) = kv; 


for all i. Since @ is a basis, we get that actually A*A = kI. This means 
that B= aA is unitary of orthogonal since B* B = J. Thus A is a scalar 


multiple of B. 
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12. (a) If A and B are unitarily equivalent. We may write B = Q* AQ for 
some unitary matrix Q. Since Q is unitary, we have |Q(x)| = |<. 
So we have 
[Bz] _ |Q*AQz| _ | AQz| 
Isl el sa 
Since any unitary matrix is invertible, we get the equality || Al] = | Bl]. 
(b) Write 


and 


We observe that 
2 2 2 
aI" = (2,2) =D) a; = Iba(2)I", 
i=l 
where ¢3(xz) means the coordinates of « with respect to 6. So we 
have 


zl] = Iee(x)| 


This means that 


TO) ay Walt (@))I 
ic] = Jos(o)l 


IZ ]e¢a() | 


== max 
¢a(2)+0 || da(x)| 


(c) We have ||T'| > & for all integer k since we have 


|T"| = max | 
x+0 


— 


= [7 ]el- 


T(x) _ [evel _ 
lel Teal 


13. (a) If A; is the largest eigenvalue of A* A, then we know that o; = V/A; = 
|All 


(b) This comes from that the nonzero singular values of Al are 


= = 
O, 207.1 2°°20,- 


(c) If A is invertible with the largest and the smallest eigenvalues of A* A 
to be Ay and \,, > 0, we know that a, = WA, and on = VAn. Hence 

we have e 
cond(A) = —. 


on 
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6.11 The Geometry of Orthogonal Operators 


1. 


(a) 


(i) 


No. It may be the compose of two or more rotations. For example, 
T(2,y) = (Ro(x), Re(y)) is orthogonal but not a rotation or a reflec- 
tion, where x,y € R? and Rg is the rotation transformation about the 
angle @. 

Yes. See the Corollary after Theorem 6.45. 


Yes. See Exercise 6.11.6. 


) No. For example, U(x,y) = (Re(x),y) and T(x, y) = (x, Re(y)) are 


two rotations, where x,y ¢ R? and Rg is the rotation transformation 
about the angle 0. But UT is not a rotation. 


) Yes. It comes from the definition of a rotation. 


) No. In two-dimensional real space, the composite of two reflections 


is a rotation. 


No. It may contains one reflection. For example, the mapping T() = 
-x could not be the composite of rotations since the only rotation in 
R is the identity. 


No. It may be the composite of some rotations and a reflection. 
For example, T(x,y) = (-2, Ro(y)) has det(T) = -1 but it’s not a 
reflection, where x € R, y € R?, and Rg is the rotation about the angle 
0. 


Yes. Let T be the reflection about W. We have that W@W*+ =V. So 
one of W and W* could not be zero. But every nonzero vector in W 
is an eigenvector with eigenvalue 1 while that in W* is an eigenvector 
with eigenvalue -1. So T’ must have eigenvalues. 


No. The rotation on two-dimentional space has no eigenvector unless 
it’s the identity mapping. 


2. By Exercise 6.5.3 we know that the composite of two orthogonal operators 


3. 


4. 


(a) 
(b) 


(c) 


(a) 


should be an orthogonal operator. 


Check that A* A = AA* =I. So A is orthogonal. Hence it’s a reflec- 
tion by Theorem 6.45 since its determinant is -1. 


Find the subspace {x : Av = x}. That is, find the null space of A- I. 
Hence the axis is 


span{(V3, 1)}. 


Compute det(B) = -1. Hence we have det(AB) = det(BA) =1. By 
Theorem 6.45, both of them are rotations. 


Compute that 
det(A) = — cos” ¢- sin? ¢ = -1. 


By Theorem 6.45, it’s a reflection. 
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(b) Find the subspace {x : Ax = x}. That is, find the null space of A- J. 
Hence the axis is 


span{(sin ¢, 1 - cos ¢) =}. 
5. Let a = {e1,€2} be the standard basis in R?. 


(a) We may check that the rotation Ty is a linear transformation. Hence 
it’s enough to know 


T(e1) = (cos¢,sin@), 
T(e2) = (-sin ¢, cos ¢) 


by directly rotate these two vectors. Hence we have [T¢]a = A. 
(b) Denote that 
Gas cos@ —sing 
®~\sing cosd ]’ 


Directly compute that AgAy = Ag+y. So we have 


[TsTwlo = [TelalTyla = AgAy = Age = [Toru ]a- 


(c) By the previous argument we kow that 
To Ty = Tory = Ty+g = Ty Ty. 


6. If U and T are two rotations, we have det(UT) = det(U) det(T) = 1. Hence 
by Theorem 6.47 UT contains no reflection. If V could be decomposed by 
three one-dimensional subspaces, they all of them are identities, thus UT 
is an identity mapping. Otherwise V must be decomposed into one one- 
dimensional and one two-dimensional subspaces. Thus UT is a rotation on 
the two-dimensional subspace and is an idenetiy on the one-dimensional 
space. Hence UT must be a rotation. 


7. (a) We prove that if T is an orthogonal operator with det(T) = 1 on 
a three-dimensional space V, then T is a rotation. First, we know 
that the decomposition of T contains no reflections by Theorem 6.47. 
According to T, V could be decomposed into some subspaces. If V 
is decomposed into three one-dimensional subspace, then T is the 
identity mapping on V since its an identity mapping on each sub- 
space. Otherwise V should be decomposed into one one-dimensional 
and one two-dimensional subspaces. Thus T is a rotation on the 
two-dimensional subspace and is an idenetiy mapping on the one- 
dimensional space. Hence T must be a rotation. 

Finally, we found that det(A) = det(B) = 1. Hence they are rotations. 


(b) It comes from the fact that det( AB) = det(A) det(B) = 1. 
(c) It should be the null space of AB - I, 


span{((1 + cos ¢)(1 - cos~), (1+ cos ¢) sinw, sin dsiny)}. 
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10. 


11. 


12. 


13. 


14. 


. If T is an orthogonal operator, we know that the determinant of T should 


be +1. Now pick an orthonormal basis 6. If det(T’) = 1, we have det([T],) = 
1 and hence [T], is a rotation matrix by Theorem 6.23. By Exercise 6.2.15 

we know that the mapping ¢p-1,, who maps x € V into its coordinates with 

respect to 3, preserve the inner product. Hence T is a rotation when [T]g 

is a rotation. On the other hand, if det(7’) = det([T]) = -1, we know that 

[T],g is a reflection matrix by Theorem 6.23. Again, T is a reflection since 

[T]g is a reflection matrix. 


. If T is a rotation, its decomposition contains no reflection. Hence we 


have det(T) = 1 by Theorem 6.47. If T is a reflection, then we have 
its decomposition could contain exactly one reflection. Hence we have 
det(T) = -1 by Theorem 6.47. So T cannot be both a rotation and a 
reflection. 


If V is a two-dimensional real inner product space, we get the result by the 
Corollary after Theorem 6.45. If V is a three-dimensional real inner prod- 
uct space and U,T are two rotations, we have det(UT) = det(U) det(T) = 
1. By the discussion in Exercise 6.11.7(a), we know that UT should be a 
rotation. 


Let T(2x,y) = (Ro(x), Re(y)) be an orthogonal operator, where a, y € R? 
and Rg is the rotation transformation about the angle 6. It is neither a 
rotation nor a reflection. 


Let 6 be an orthonormal basis. Then we have [T], = —I,, where n is the 
dimsion of V. Since det(-I,,) = (-1)”, we know that T could decomposed 
into rotations if and only if n is even by the Corollary after Theorem 6.47. 


Use the notation in that Lemma. We know that 
W = 65: (Z) = span{ $3! (#1), 5 (#2)}. 
And compute that 
T ($5 (xi) = op (Ari) € $5' (Z) 
for 1=1,2. Hence W is T-invariant. 
(a) It comes from that 
[Zw (2) = IZ) = Tel. 


(b) Suppose y is an element in W+. Since Ty is invertible by the previous 
argument, for each « « W we have x = T(z) for some z ¢ W. This 
means that 


(T(y), 2) = (y, T°T(2)) = (y, 2) = 0. 


(c) It comes from that 


[Zw (x)I = IT@)I = lel. 
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15. 
16. 


17. 


18. 


Let ¢ = 0 in the equality in Theorem 5.24. 
(c) As the definition of T; given in the Corollary, we know that 
TiLj(0) = + T(ai) ++ Taj) +L, T(0), 


where 
L=AX%z+Latert+%m.- 


(d) Again, we have 
T(x) > T(«1) + T (22) epee T(@m) = TT>-Tm(2), 


where 
L=L_~+Late-+%m. 


(e) We know that det(Ty,) = det(T;) since V = W; @ W; and 


So T; is a rotation if and only if Ty, is a rotation. By Theorem 6.47 
we get the result. 


I think here we won’t say an identity is a rotation. Otherwise the identity 
mapping could be decomposed into n identity mapping. Also, we need 
some fact. That is, if W; is a subspace with dimension one in the decom- 
position, then Ty, could not be a rotation since Tw, (x) sould be either x 
or —%. Hence every ration has the dimension of it subspace two. 


(a) By the Corollary after Theorem 6.47 we know that there is at most 
one reflection in the decomposition. To decompose a space with 
dimension n by rotations, there could be only s(n — 1) rotations. 


(b) Similarly, there is at most one reflection. If there’s no reflection, then 
there’re at most $n rotations. If there’s one reflection, there at most 


- 1 
eee Perey ee 
[>= 5(n-2) 
rotations. 


Let 6 = {x,x'} be an orthonormal basis of V. Since ||y|| = 1, we may write 
¢a(y) = (cos ¢,sing) for some angle ¢. Let 


_f{cos@ —-sing 
*~\sind cosd 


and T be the transformation with [T]s = A. We have that T(x) = y and 
T is a rotation. 


On the other hand, by the definition of a rotation, we must have 


T(2) = (cos@)zx + (sin6)a" 
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and 
T(2’) = (-sin@)x + (cos6)z’. 


Thus we must have cos¢ = cos@ and sing = sind. If 0< ¢,0 < 27, we must 
have 6=9@. So the rotation is unique. 
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Chapter 7 


Canonical Forms 


7.1 The Jordan Canonical Form I 


1. (a) Yes. It comes directly from the definition. 


(b) No. Ifa is a generalized eigenvector, we can find the smallest positive 
integer p such that (T - AI)?(x) = 0. Thus y = (T- AI)?! + 0 is 
an eigenvector with respect to the eigenvalue 4. Hence A must be an 
eigenvalue. 


(c) No. To apply the theorems in this section, the characteristic polyno- 


mial should split. For example, the matrix (' Fi over R has no 


eigenvalues. 
(d) Yes. This is a result of Theorem 7.6. 


(e) No. The identity mapping Iz from C? to C? has two cycles for the 
eigenvalue 1. 


(f) No. The basis 6; may not consisting of a union of cycles. For example, 
the transformation T(a,b) = (a+b,b) has only one eigenvalue 1. The 
generalized eigenspace K, = F?. If 


B= {(1, 1), Cy=1) hs 


then the matrix representation would be 


which is not a Jordan form. 
(g) Yes. Let a be the standard basis. Then [L7]q = J is a Jordan form. 
(h) Yes. This is Theorem 7.2. 
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2. Compute the characteristic polynomial to find eigenvalues as what we did 
before. For each A, find a basis for K consisting of a union of disjoint 
cycles by computing bases for the null space of (A-AJ)?” for each p. Write 
down the matrix S whose columns consist of these cycles of generalized 
eigenvectors. Then we will get the Jordan canonical form J = S7'AS. 
When the matrix is diagonalizable, the Jordan canonical form should be 
the diagonal matrix sismilar to A. For more detail, please see the examples 
in the textbook. On the other hand, these results were computed by 
Wolfram Alpha. For example, the final question need the command below. 


JordanDecomposition[{{2,1,0,0},{0,2,1,0},{0,0,3,0},{0,1,-1,3}}] 


sel) 


(a) 


(b) 


1 0 Ol 2 1 0 0 
0 1 01 0 2 0 0 
oe 0 0 01 i= 0 0 3 0 
0 -1 1 O 0 0 0 3 


3. Pick one basis 8 and write down the matrix representation [T],;. Then 
do the same thing in the previous exercises. Again, we denote the Jordan 
canonical form by J and the matrix consisting of Jordan canonical basis 
by S$. The Jordan canonical basis is the set of vector in V corresponding 
those column vectors of S in F”. 


(a) Pick 8 to be the standard basis 
{1,,0°} 


and get 


and 


4148 ete} 


[T]e = | 


(c) Pick 6 to be the standard basis 


(b) Pick 6 to be the basis 


and get 


| 


mar 


a) 


—=) 


oo 


0 1 0 0 0 
0 0 2 0 0 
0 0 0 0 0 


oo 


and 


}-(r o}(0 sp 


1 O\ /O0 1 
0 O}7’\0 O 


( 


and get 


and 


(d) Pick 6 to be the standard basis 


}-(1 o}-(0 Sp 


1 O\ fO 1 
0 O}’\O O 


( 


and get 


and 
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4. We may observe that W = span(7) is (JT - AJ)-invariant by the definition 
of a cycle. Thus for all w ¢ W, we have 


T(w) = (T -AL)(w) + AT(w) = (T - AT) (w) + Aw € W. 


5. If x is an element of in two cycles, which is said to be 7, and y2 without 
lose of generality, we may find the smallest ingeter g such that 


(T - d1)%(z) = 0. 


This means that the initial eigenvectors of y, and y2 are both (T - 
AI)" (x). This is a contradiction. Hence all cycles are disjoint. 


6. (a) Use the fact that T(x) =0 if only if (-T)(«) = -0=0. 
(b) Use the fact that (-T)* = (-1)*T. 
(c) It comes from the fact 
(Aly -T)* =[-(T-Aly)]* 
and the previous argument. 
7. (a) If U*(x) =0, then U**!(x) = U*(U(z)) = 0. 

(b) We know U™1(V) = U™(U(V)) c U™(V). With the assumption 

rank(U™) = rank(U™*'), we know that 
Perv yew eV), 
This means 
u(u™(V)) =U" (V) 

and so U*(V) =U™(V) for all integer k > m. 

(c) The assumption rank(U™) = rank(U™*') implies 

null(U™) = null(U™*") 

by Dimension Theorem. This means N(U™) = N(U™*!) by the 
previous argument. If U*?(x) = 0, then U(a) is an element in 
N(U™*?) = N(U™). Hence we have U™(U(z)) = 0 and thus z is an 
element in N(U™*'). This means that N(U™*?) c N(U™*') and 
so they are actually the same. Doing this inductively, we know that 
N(U™) = N(U®) for all integer k > m. 

(d) By the definition of Ky, we know 

K) = Upp N((T - Ar)?). 

But by the previous argument we know that 


N((T-AD)™) = N((T-AI)*) 


for all integer k > m and the set is increasing as k increases. So 
actually Ky is N((T-AI)™). 
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(e) Since the characteristic polynomial splits, the transformation T is 
diagonalizable if and only if kK) = Ey. By the previous argument, we 
know that 

KAN ODS ES 


(f) If A is an eigenvalue of Ty, then \ is also an eigenvalue of T by 
Theorem 5.21. Since T’ is diagonalizable, we have the condition 
rank(T — AI) = rank((T — dI)”) 
and so 
N(T- AI) = N((T-AI)”) 
by the previous arguments. This implies 
N(Tw - lw) = N((Tw - Alw)”). 
By Dimension Theorem, we get that 
rank(Tw — ATw) = rank((Tw - ATw)?). 
So Ty is diagonalizable. 


8. Theorem 7.4 implies that 
V= ®@) Ky. 


So the representation is uniques. 


9. (a) The subspace W is T-invariant by Exercise 7.1.4. Let {v;}; be the 
cycle with initial vector v;. Then [Ty], is a Jordan block by the 
fact 

Tw (v1) = AV, 


and 
Tw (vi) = Uj-1 + AV; 


for alli >1. And f is a Jordan canonical basis since each cycle forms 
a block. 


If the ii-entry of [T]g is A, then v; is an nonzero element in Ky. Since 
Ky K,, = {0} for distinct eigenvalues \ and ps, we know that £" is 
exactly those v;’s such that the ii-entry of [T], is A. Let m be the 
number of the eigenvalue 2 in the diagonal of [T’]g. Since a Jordan 
form is upper triangular. We know that m is the multiplicity of .. 
By Theorem 7.4(c) we have 


— 
og 
wa 


dim(K)) =m = ||. 
So f’ is a basis for Ky. 
10. (a 


wa 


Those initial vectors of q disjoint cycles forms a independent set con- 
sisting of eigenvectors in Fy. 
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11. 


12. 


13. 


(b) Each block will correspond to one cycle. Use the previous argument 
and get the result. 


By Theorem 7.7 and the assumptiong that the characteristic polynomial of 
Ly splits, the transformation [4 has a Jordan form J and a corresponding 
Jordan canonical basis 3. Let a be the standarad basis for F”. Then we 
have 

J = (Lalo = [TZ ALT13 


and so A is similar to J. 


By Theorem 7.4(b), the space V is the direct sum of each span(;) = K),, 
where (; is a basis for K),. 


Denote Ky, by W;. Let §; be a Jordan canonical basis for Ty,. Apply 
Theorem 5.25 and get the result. 


7.2. The Jordan Canonical Form II 


1. 


2s 


(a) Yes. A diagonal matrix is a Jordan canonical form of itself. And the 
Corollary after Theorem 7.10 tolds us the Jordan canonical form is 
unique. 

(b) Yes. This is a result of Theorem 7.11. 


1 1 0 : 

0 1 and 0 ) have different Jordan 
canonical form. By Theorem 7.11, they cannot be similar. 

(d) Yes. This is a result of Theorem 7.11. 


(e) Yes. They are just two matrix representations of one transformation 
with different bases. 


(c) No. The two matrices 


(f) No. The two matrices ( ) and ( have different Jordan 


canonical form. 


(g) No. The identity mapping J from C? to C? has two different bases 


{(1,0), (0, 1)} 


and 


{(1, Dy, Cl, une 
(h) Yes. This is the Corollary after Theorem 7.10. 


A coloumn in a dot diagram means a cycle. Each cycle has the corre- 
sponding eigenvalue as the diagonal entries and 1 as the upper subdiagonal 
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entries. So we have 


210000 
GO? Oe). O00) 
7-)9 9 2.0 0 0 
sai (0 ae 3 is | eet 
6.0. 02: 2 0 
00000 2 
4100 
0410 
Jz=1o 9 4 of? 
000 4 


and 


where O is the zero matrix with appropriate size. 


3. (a) It’s upper triangular. So the characteristic polynomial is (2-t)°(3- 


t)?) 
(b) Do the inverse of what we did in Ecercise 7.2.2. For A = 2, the dot 
diagram is 
e e 
e e. 
e 
For A = 3, the dot diagram is 
e e. 


(c) The eigenvalue A with it corresponding blocks are diagonal is has the 
property EF) = Ky. 

(d) The integer p; is the length of the longest cycle in Ky,. So pz = 3 and 
p3 =1. 

(e) By Exercise 7.1.9(b), the matrix representations of U2 and U3 are 


ooOoCcCo oO 
oooo Ff 
ooorOo 
oOoCcCOo oO 
Oroc;c.hcoc 
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and 
0 0 
(0 0) 
So we have rank(U2) = 3, rank(U?) = 1 and rank(U3) = rank(U3) = 0. 
By Dimension Theorem, we have null(U2) = 2, null(U?) = 4 and 
null(U3) = null(U?) = 2. 


4. Do the same thing in Exercise 7.1.2. 


(a) 
ee es i OO 
Q=|2 -1 -2],7=/0 2 1). 
i a 6 002 


(b) 
i ae A 6 O46 
Oslo 0 ore. O60 
1 0 O02 
(c) 
tsp 2 10 0 
Q=[-1 -1 -g|,J=[0 2 1 
1. 32.0 Og 2 
(d) 
tO. fi; 4 Oat dar 
A ey “Os Th 00 0 0 
2-1, 2 9 1]7=/0 0 2 of: 
1 0 1 0 000 2 


5. For the following questions, pick some appropriate basis 6 and get the 
marix representation A = [T’]g. If A is a Jordan canonical form, then we’ve 
done. Otherwise do the same thing in the previous exercises. Similarly, 
we set J = Q-'AQ for some invertible matrix Q, where J is the Jordan 
canonical form. And the Jordan canonical basis is the set of vector in V 
corresponding those column vectors of Q in F”. 


(a) Pick the basis 6 to be 
t,t Lot ot 
{e’, te eu e’,e"} 


and get the matrix representation 


ooo Fr 
OOF rF 
on eS 
en an an) 
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(b) Pick the basis 8 to be 
2 33 
Be as ae 
2 12 


and get the matrix representation 


} 


ooco 
oOoCcC oO 
ooOoroO 
orooe 


(c) Pick the basis 8 to be 


ooo nw~w 
oONrF 


(d) Pick the basis 8 to be 


(i 0).(6 o)-( o)(0 


and get the matrix representation 


2 0O 1 O 
0 3 -1 1 
Ae 0 -1 3 =O 
0 0 O 2 
Thus we have 
1 0 O 1 
0 1 -1 -2 
Q=lo 1 0 2 
00 2 0O 
and 
2 1 0 0 
02 1 0 
we 00 2 OF 
000 4 


(e) Pick the basis 8 to be 


(o o)-(0 )-( o)-60 ap 
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and get the matrix representation 


Thus we have 


and 


(f) Pick the basis 8 to be 


{1,a,y,27,y", zy} 


and get the matrix representation 


000 0 0 0 
000 0 0 0 


Thus we have 


ooo an 


HIN 


ooo pS 


Tuoco 


oC OANoO © 


0 
1 


ocooceo 


1 
0 


ooo o 


° 


0 10 0 0 0 
001 0 0 0 
000 0 0 0 
000 0 1 0 
000 0 0 0 
000 0 0 0 


and 
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6. The fact rank(/) = rank(M"‘) for arbitrary square matrix M and the fact 


7. 


8. 


9. 


(AP AD)" = [(A- AD)" = [(A-AD)T 


tell us 


rank((A - AI)") = rank((A‘ - AJ)"). 


By Theorem 7.9 we know that A and At have the same dot diagram and 
the same Jordan canonical form. So A and A? are similar. 


(a) Let 7’ be the set {v;}; = 1”. The desired result comes from the fact 


T(v;) = AU; + Uj41 


for 1<i<m-1 and 
T(vm) =AUm- 


(b) Let @ be the standard basis for F” and f’ be the basis obtained from 


B by reversing the order of the vectors in each cycle in 8. Then we 
have [Ly]g = J and [Ly]g = J‘. So J and J" are similar. 


(c) Since J is the Jordan canonical form of A, the two matrices J and A 


are similar. By the previous argument, J and J* are similar. Finally, 
that A and J are similar implies that A‘ and J® are similar. Hence 
A and A? are similar. 


(a) Let 6 be the set {v;}32,. Then we have the similar fact 


T(cv,) = Acv, 
and 
T(cv;) = Acu; + CUy_1. 


So the matrix representation does not change and the new ordered 
basis is again a Jordan canonical basis for T’. 


(b) Since (T - AI)(y) = 0, the vector T(a + y) = T(x) does not change. 


Hence 7’ is a cycle. And the new basis obtained from 6 by replacing 
+ by 7’ is again a union of disjoint cycles. So it isa Jordan canonical 
basis for T. 


(c) Let x = (-1,-1,-1,-1) and y = (0,1,2,0). Apply the previous argu- 


ment and get a new Jordan canonical basis 


-1\ (0\ /1\ /1 
0] {1} Jo] fo 
{ |: ) 2 :] 0 Pi 0 }. 
-1) \o} \o} \a 


(a) This is because we drawn the dot diagram in the order such that the 


length of cycles are decreasing. 
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(b) 


We know that p; and r; are decreasing as i and 7 become greater. 
So p; is number of rows who contains more than or equal to j dots. 
Hence 

pj =max{i:r; > 7}. 


Similarly, r; is the number of columns who contains more than or 
equal to i dots. Hence 


r,=max{j:p+ >i}. 


It comes from the fact that p; decreases. 


) There is only one way to draw a diagram such that its i-th row 


contains exactly r; dots. Once the diagram has been determined, 
those p;’s are determined. 


By Theorem 7.4(c), the dimension of K) is the multiplicity of X. 
And the multiplicity of A is the sum of the lengths of all the blocks 
corresponding to A since a Jordan canonical form is always upper 
triangular. 


Since Fy, c Ky, these two subspaces are the same if and only if 
the have the same dimension. The previous argument provide the 
desired result since the dimension of Ey is the number of blocks 
corresponding to A. The dimensions of them are the same if and 
only if all the related blocks have size 1 x 1. 


11. It comes from the fact that 


[T?]a = ([T]s)”. 


12. Denote D; to be the diagonal consisting of those 2j-entries such that 
i-j=k. So Do is the original diagonal. If A is upper triangular matrix 
whose entries in Do are all zero, we have the fact that the entries of A? in 
Dy, 0< k <-p, are all zero. So A must be nilpotent. 


13. 


If T*(x) = 0, then T*!(z) = T*(T(x)) = 0. 


Pick 8 to be one arbitrary basis for N(T'). Extend @; to be a basis 
for N(T‘*'). Doing this inductively, we get the described sequence. 


By Exercise 7.1.7(c), we know N(T') + N(T*') for i< p-1. And 
the desired result comes from the fact that 


T (i) ¢ N(T**) = span(6;-1) # span(3;). 
The form of the characteristic polynomial directly comes from the 


previous argument. And the other observation is natural if the char- 
acteristic polynomial has been fixed. 
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14. 


15. 


16. 


Lh. 


18. 


Since the characteristic polynomial of T splits and contains the unique 
zero to be 0, T’ has the Jordan canonical form J whose diagonal entries 
are all zero. By Exercise 7.2.12, the matrix J is nilpotent. By Exercise 
7.2.11, the linear operator T is also nilpotent. 


The matrix 
0 0 O 
A=]0 0 1 
0 -1 O 


has the characteristic polynomial to be -t(t? +1). Zero is the only eigen- 
value of T = L,4. But T and A is not nilpotent since A® = —A. By Exercise 
7.2.13 and Exercise 7.2.14, a linear operator T is not nilpotent if and only 
if the characteristic polynomial of T is not of the form (-1)"t”. 


Since the eigenvalue is zero now, observe that if x is an element in (6 
corresponding to one dot called p, then T(a) would be the element cor- 
responding to the dot just above the dot p. So the set described in the 
exercise is an independent set in R(T’). By counting the dimension of 
R(T") by Theorem 7.9, we know the set is a basis for R(T"). 


(a) Assume that 
L=VUyptug¢ee + Up 
and 
Y=Uy + Ug te + Uk. 


Then S is a linear operator since 
S(at+cy) = A1(u1 + cur) +++ + Ag (UR + CUR) 


= S(xr)+cS(y). 


Observe that if v is an element in AK), then S(v) = Av. This means 
that if we pick a Jordan canonical basis 8 of T for V, then []g is 
diagonal. 


(b) Let 8 be a Jordan canonical basis for T. By the previous argument 
we have [T']g = J and [S]g = D, where J is the Jordan canonical 
form of T and D is the diagonal matrix given by S. Also, by the 
definition of S, we know that [U], = J-D is an upper triangular 
matrix with each diagonal entry equal to zero. By Exercise 7.2.11 
and Exercise 7.2.12 the operator U is nilpotent. And the fact U and 
S commutes is due to the fact J--D and D commutes. The later fact 
comes from some direct computation. 


Actually, this exercise could be a lemma for Exercise 7.2.17. 


(a) It is nilpotent by Exercise 7.2.12 since M is a upper triangular matrix 
with each diagonal entry equal to zero. 


(b) It comes from some direct computation. 
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19. 


20. 


21 


(c) Since MD = DM, we have 


J’ =(M+D)"=> (7)utpe. 
kao \K 


The second equality is due to the fact that M* = O for all k > p. 
(a) It comes from some direct computation. Multiplying N at right 


means moving all the columns to their right columns. 


(b) Use Exercise 7.2.18(c). Now the matrix M is the matrix N in Exercise 
7.2.19(a). 


(c) If |A| < 1, then the limit tends to a zero matrix. If A= 1 amd m=1, 
then the limit tends to the identity matrix of dimension 1. Con- 
versely, if |A| > 1 but A # 1, then the diagonal entries will not converge. 
If X=1 but m>1, the 12-entry will diverge. 


(d) Observe the fact that if J is a Jordan form consisting of several 
Jordan blocks Jj, then J” = @;J7. So the limm—.oJ™ exsist if and 
only if lim». J!” exists for all i. On the other hand, if A is a 
square matrix with complex entries, then it has the Jordan canonical 
form J = Q-!.AQ for some Q. This means that lim,,_... A” exists if 
and only if lim,,.. J” exists. So Theorem 5.13 now comes from the 
result in Exercise 7.2.19(c). 


(a) The norm |All > 0 since |A;,,;| > 0 for all i and 7. The norm ||A|| = 0 if 
and only if |A;;| = 0 for all ¢ and j. So | All = 0 if and only if A=O. 


(b) Compute 
|cAl] = max{|cAjj|} = |e] max{|Az,l} = [el Al]. 
(c) It comes from the fact 
|Aij + Bis| < |Aigl + [Bay 
for all ¢ and j. 


(d) Compute 


| AB| = max{|(AB),;|} = max{| }> Aj, Be;l} 
k=l 


<max{| )7 | Al) Bl]|} = n| All| BI. 


(a) The Corollary after 5.15 implies that A” is again a transition matrix. 
So all its entris are no greater than 1. 
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(b) Use the inequaliy in Exercise 7.2.20(d) and the previous argument. 


We compute 
JI" |= |PCA™P| <n? |PUA' P| sr? |P PL 


Pick the fixed value c = n?||P71||||P|| and get the result. 


By the previous argument, we know the norm |.J”|| is bounded. If 
J, is a block corresponding to the eigenvalue 1 and the size of J; 
is greater than 1, then the 12-entry of Jj” is unbounded. This is a 
contradiction. 


By the Corollary 3 after Theorem 5.16, the absolute value of eigen- 
values of A is no greater than 1. So by Theorem 5.13, the limit 
limy,+. A” exists if and only if 1 is the only eigenvalue of A. 


Theorem 5.19 confirm that dim(£,) = 1. And Exercise 7.2.21(c) 
implies that Ky = Ey. So the multiplicity of the eigenvalue 1 is equal 
to dim(K,) = dim(£,) = 1 by Theorem Theorem 7.4(c). 


22. Since A is a matrix over complex numbers, A has the Jordan canonical 
form J = Q-!AQ for some invertible matrix Q. So e4 exists if e7 exist. 
Observe that 


23. 


Jur isn™™ I 


by Exercise 7.20(d). This means the sequence in the definition of e7 


J 


converge absolutely. Hence e” exists. 


(a) For brevity, we just write \ instead of \;. Also denote uz, to be 


(A-AI)¥u. So we have (A- AD)uz = ups. Let 


p-1 
pee" SO Oueaeel 
k=0 


be the function vector given in this question. Observe that 
p-1 
(A= AD = e¥ LYS f (8)up-x]. 
k=0 
Then compute that 
p-l 
af = de eM ES FO (Cup 1a 
k=0 


p-1 


=a te [D> f (thup-e] 


k=l 
=\n+(A-Al)a = Az. 
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(b) Since A is a matrix over C, it has the Jordan canonical form J = 
Q-'AQ for some invertible matrix Q. Now the system become 


a’ =QJIQ ‘a 
and so 
Qa’ = JQ a. 
Let y= Q-ta and so x = Qy. Rewrite the system to be 


yf Jy: 


Since the solution of y is the linear combination of the solutions of 
each Jordan block, we may just assume that J consists only one block. 
Thus we may solve the system one by one from the last coordinate 
of y to the first coordinate and get 


f(t) 
M fOW 


fo 


On the other hand, the last column of Q is the end vector u of the 
cycle. Use the notation in the previous question, we know Q has the 
form 

li a 


Q=|Up-1 Up-2 -* Uo 
Thus the solution must be x = Qy. And the solution coincide the 
solution given in the previous question. So the general solution is the 
sum of the solutions given by each end vector wu in different cycles. 


ye 


24. As the previous exercise, we write Q-'AQ = J, where J is the Jordan 
canonical form of A. Then solve Y’ = JY. Finally the answer should be 
X =QY. 


(a) The coefficient matrix is 


Compute 


and 


Thus we know that 


at+b 0 
Y =e a +e 10 
0 Cc 
and so the solution is 
X =QY. 
(b) The coefficient matrix is 

2 1 0 

A=]0 2 1 

0 0 2 


So J=A and Q =I. Thus we know that 


at? +bt+ce 
Y=e74] 2at+b 
2a 
and so the solution is 
X = QY. 


7.3. The Minimal Polynomial 


1. (a) No. If p(t) is the polynomial of largest degree such that p(T) = To, 
then q(t) = t(p(t) is a polynomial of larger degree with the same 
property q(T) = Tp. 

(b) Yes. This is Theorem 7.12. 


(c) No. The minimal polynomial divides the characteristic polynomial 
by Theorem 7.12. For example, the identity transformation J from 
R? to R? has its characteristicpolynomial (1 -t)? and its minimal 
polynomial ¢ - 1. 


(d) No. The identity transformation J from R? to R? has its characteristicpolynomial 
(1-t)? but its minimal polynomial t- 1. 


(e) Yes. Since f splits, it consists of those factors (t - A)" for some 
r <n and for some eigenvalues A. By Theorem 7.13, the minimal 
polynomial p also contains these factors. So f divides p”. 


(f) No. For example, the identity transformation J from R? to R? has its 
characteristicpolynomial (1 - +)? and its minimal polynomial t - 1. 


(g) No. For the matrix F i) its minimal polynomial is (t- 1)? but it 


is not diagonalizable. 
(h) Yes. This is Theorem 7.15. 
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(i) Yes. By Theorem 7.14, the minimal polynomial contains at least 
n zeroes. Hence the degree of the minimal polynomial of T must 
be greater than or equal to n. Also, by Cayley-Hamilton Theorem, 
the degree is no greater than n. Hence the degree of the minimal 
polynomial of T must be n. 


2. Let A be the given matrix. Find the eigenvalues \; of A. Then the 
minimal polynomial should be of the form J];(¢- ;)}. Try all possible 
r,’s. Another way is to compute the Jordan canonical form. 


(a) The eigenvalues are 1 and 3. So the minimal polynomial must be 
(t-1)(t-3). 

(b) The eigenvalues are 1 and 1. So the minimal polynomial could be 
(t-1) or (t-1)?. Since A- I # O, the minimal polynomial must be 
(t-1)?. 


(c) The Jordan canonical form is 


2 0 0 
O 1 1. 
0 0 1 


So the minimal polynomial is (t - 2)(¢- 1)?. 


(d) The Jordan canonical form is 


2 1 0 
O 2 Of. 
0 0 2 


So the minimal polynomial is (t - 2). 


3. Write down the matrix and do the same as that in the previous exercise. 


(a) The minimal polynomial is (t — /2)(t + V2). 
(b) The minimal polynomial is (t - 2)?. 

(c) The minimal polynomial is (¢ - 2)?. 

(d) The minimal polynomial is (¢ + 1)(¢- 1). 


4. Use Theorem 7.16. So those matrices in Exercises 7.3.2(a), 7.3.3(a), and 
7.3.3(d) are diagonalizable. 


5. Let f(t) = t? - 2¢+¢t = t(t-1)?. Thus we have f(T) = Ty. So the 
minimal polynomial p(t) must divide the polynomial f(t). Since T is 
diagonalizable, p(t) could only be t, (t- 1), or t(t-1). If p(t) = t, then 


T =Tp. If p(t) = (t- 1), then T =I. If p(t) = ¢(t- 1), then [T]p = i; 


for some basis (. 
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6. Those results comes from the fact [f(T)]g = O if and only if f(T) = Tp. 


7. By Theorem 7.12, p(t) must of that form for some m; < n;. Also, by 
Theorem 7.14, we must have m; > 1. 


8. (a) Let f(t) be the characteristic polynomial of T. Recall that det(T) = 
f(0) by Theorem 4.4.7. So By Theorem 7.12 and Theorem 7.14 0 is 
a zero for p(t) if and only if 0 is a zero for f(t). Thus T is invertible 
if and only if p(0) #0. 


(b) Directly compute that 


1 
S(T eg gt reel tar 
ao 


222 (GD) = as) ST. 
ao 


9. Use Theorem 7.13. We know that V is a T-cyclic subspace if and only 
if the minimal polynomial p(t) = (-1)" f(t), where n is the dimension of 
V and f is the characteristic polynomial of T. Assume the characteristic 
polynomial f(t) is 


(t-A1)"* (E- Ag)"?(E- An)”, 


where n, is the dimension of the eigenspace of A; since T is diagonalizable. 
Then the minimal polynomial must be 


(t— Az) (t- A2)--(E- Ag). 
So V is a T-cyclic subspace if and only if n; = 1 for all 2. 


10. Let p(t) be the minimal polynomial of T. Thus we have 


p(Tw )(w) = p(T) (w) = 0 


for all w ¢ W. This means that p(Ty) is a zero mapping. Hence the 
minimal polynomial of Ty divides p(t). 


11. (a) If ye V is a solution to the equation g(D)(y) = 0, then g(D)(y’) = 
(g(D)(y))' = 0€V. 
(b) We already know that 
g(D)(y) = 9 


for all ye V. So the minimal polynomial p(t) must divide g(t). If 
the degree of p(t) is less than but not equal to the degree of g(t), 
then the solution space of the equation p(D)(y) = 0 must contain V. 
This will make the dimension of the solution space of p(D)(y) = 0 
greater than the degree of p(t). This is a contradiction to Theorem 
2.32. Hence we must have p(t) = g(t). 
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12. 


13. 


14. 


15. 


16. 


(c) By Theorem 2.32 the dimension of V is n, the degree of g(t). So by 
Theorem 7.12, the characteristic polynomial must be (-1)"g(t). 


Suppose, by contradiction, there is a polynomial g(t) of degree n such 
g(D) = To. Then we know that g(D)(«”) is a constant but not zero. This 
is a contradiction to the fact g(D)(«”) = To(x”) = 0. So D has no minimal 
polynomial. 


Let p(t) be the polynomial given in the question. And let 8 be a Jordan 
basis for T. We have (T - \;)?i(v) = 0 if vu is a generalized eigenvector 
with respect to the eigenvalue \;. So p(T)(38) = {0}. Hence the minimal 
polynomial g(t) of T must divide p(t) and must be of the form 


(t= y= 9) (b- Ag) 


where 1 <r; < p;. If r; < p; for some i, pick the end vector u of the cycle 
of length p; in 8 corresponding to the eigenvalue ;. This u exist by the 
definition of p;. Thus (T-2;)" (wu) = w #0. Since Ky, is (T-A,)-invariant 
and T’— A; is injective on K), for all 7 #7 by Theorem 7.1, we know that 
q(T)(u) # 0. Hence r; must be p;. And so p(t) must be the minimal 
polynomial of T’. 


The answer is no. Let T be the identity mapping on R?. And let W, be 
the z-axis and W2 be the y-axis. The minimal polynomial of Ty, and Tw, 
are both t-1. But the minimal polynomial of T is (t-1) but not (t-1)?. 


(a) Let W be the T-cyclic subspace generated by x. And let p(t) be the 
minimal polynomial of Ty. We know that p(Tw) = To. If q(t) is 
a polynomial such that q(T)(x) = 0, we know that q(T)(T*(2x)) = 
T*(q(T)(2)) = 0 for all k. So p(t) must divide q(t) by Theorem 7.12. 
Hence p(t) is the unique T-annihilator of «. 


(b) The T-annihilator of x is the minimal polynomial of Ty by the pre- 
vious argument. Hence it divides the characteristic polynomial of T, 
who divides any polynomial for which g(T) = To, by Theorem 7.12. 


(c) This comes from the proof in Exercise 7.3.15(c). 


(d) By the result in the previous question, the dimension of the T-cyclic 
subspace generated by x is equal to the degree of the T-annihilator 
of x. If the dimension of the T-cyclic subspace generated by x has 
dimension 1, then T(x) must be a multiple of x. Hence x is an 
eigenvector. Conversely, if 2 is an eigenvector, then T(a) = Ax for 
some A. This means the dimension of the T-cyclic subspace generated 
by x is 1. 


(a) Let f(t) be the characteristic polynomial of T. Then we have f(T) (x) 
To(x) =0€ Wy. So there must be some monic polynomial p(t) of least 
positive degree for which p(T)(x) € Wy. If h(t) is a polynomial for 
which h(T)(x) € W1, we have 


h(t) = p(t)a(t) + r(t) 
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for some polynomial q(t) anr r(t) such that the degree of r(t) is less 
than the degree of p(t) by Division Algorithm. This means that 


(T)(@) = K(T)(a) - p(T) p(T) (x) «Wi 


since W, is T-invariant. Hence the degree of r(t) must be 0. So p(t) 
divides h(t). Thus g(t) = p(t) is the unique monice polynomial of 
least positive degree such that gi(T)(«) « W1. 

(b) This has been proven in the previous argument. 

(c) Let p(t) and f(t) be the minimal and characteristic polynomials of 
T. Then we have p(T)(x) = f(T)(x) = 0 € Wy. By the previous 
question, we get the desired conclusion. 

(d) Observe that go(T')(a) € Wo c Wi. So gi(t) divides go(t) by the 
previous arguments. 


7.4 The Rational Canonical Form 


1. (a) Yes. See Theorem 7.17. 


(b) No. Let T(a,b) = (a,2b) be a transformation. Then the basis § = 
{(1, 1), (1, 2)} is a T-cyclic basis generated by (1,1). But it is not a 
rational canonical basis. 

(c) No. See Theorem 7.22. 


(d) Yes. If A is a square matrix with its rational canonical form C with 
rational canonical basis 8, then we have C = [La], . 


(e) Yes. See Theorem 7.23(a). 
(f) No. They are in general diffrent. For example, the dot diagram of 


‘ 1 ; 
the matrix has only one dot. It could not forms a basis. 


1 

0 1 

(g) Yes. The matrix is similar to its Jordan canonical form and its ra- 
tional canonical form. Hence the two forms should be similar. 


2. Find the factorization of the characteristic polynomial. Find the basis of 
K 4 for each some monic irreducibla polyomial factor consisting of T-cyclic 
bases through the proof of Theorem 7.22. Write down the basis with some 
appropriate order as the columns of Q. Then compute C = Q7'.AQ or find 
C by the dot diagram. And I want to emphasize that I compute these 
answers in Exercises 7.4.2 and 7.4.3 by HAND! 


(a) It is a Jordan canonical form. So 


001 
Q=|0 1 6 
1, 4s °9 
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and 


(b) It has been already the rational canonical form since the character- 
istic polynomial t? + t+ 1 is irreducible in R. So C = A and Q =I. 


(c) It is diagonalizable in C. So 


1 i} 
Q = V3it1 1-V3i 


2 


and 


th 27-4 
Od ea 3 
en (Oe ee 
OO Se 328 
and 
000 -1 
100 0 
Calo 4. <8 
001 0 


(e) Use (0,-1,0,1) and (3,1,1,0) as generating vectors. So 


0 43 3-38 
at “20s 
Q=|9 -3 1 5 
hy. ea AF 
and 
0-2 0 0 
1 0 0 0 
C=lo 0 0 3)" 
0 0 1 +0 


3. Write down the matrix representation A by some basis 6 and find the 
rational C = Q-'AQ for some inveritble Q. Then the rational canonical 
basis is the basis corresponding the columns of Q. 

(a) Let 
B=({1,a, 27, 2°}. 
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Then 


and 


= $5. Then 


(b) Let 8 


and 


(c) Let 


Then 


and 


(d) Let 


Then 
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and 


O° =f 0; 6 
1 10 0 
Ol ok oo. 
001 1 
(e) Let 8=S. Then 
Oe ase J 
1. Oh: 2. 
a (5 
Or. Oa ao 
and 
0-4 0 0 
1: 2G OG) 
C=lo 0 0 of 
0 0 00 


4. (a) We may write an element in R(¢(T)) as ¢(T)(x) for some x. Since 
EEN) = 0 for all v, we have ($(T))'"""(9(T)(2)) = ((Z))""(x) = 


1 1 0 
0 1 0 
0 0 1 


has minimal polynomial (t-1)?. Compute R(L4-I) = span{(1,0,0)}. 
But (0,0,1) is an element in N(L,4-J) but not in R(Ly4 - I). 

(c) We know that the minimal polynomial p(t) of the restriction of T 
divides (d(t))™ by Exercise 7.3.10. Pick an element x such that 
(¢(T))™'(x) #0. Then we know that y = (T)(z) is an element in 
R(¢(T)) and (¢(T))”-?(y) #0. Hence p(t) must be (¢(t))”™*. 


(b) The matrix 


5. If the rational canonical form of T is a diagonal matrix, then T is diagonal- 
izable naturally. Conversely, if T is diagonalizable, then the characteristic 
polynomial of 7’ splits and Ey = Ky,, where @) = t— A, for each eigenvalue 
A. This means each cyclic basis in Kg, is of size 1. That is, a rational 
canonical basis consisting of eigenvectors. So the rational canonical form 
of T is a diagonal matrix. 


6. Here we denote the degree of ¢; and ¢2 by a and 0 respectly. 


(a) By Theorem 7.23(b) we know the dimension of Ky, and Kg, are a and 
b respectly. Pick a nonzero element v; in Ky,. The T-annihilator of 
v1 divides ¢,. Hence the T-annihilator of v; is ¢,. Find the nonzero 
vector vg in Kg, similarly such that the T-annihilator of v2 is ¢2. 
Thus 6,, U 8, is a basis of V by Theorem 7.19 and the fact that 
|Bu, UBy,|=at+b=n. 
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10. 


11. 


12. 


(b) Pick vg = v1 + ve, where v; and v2 are the two vectors given in the 
previous question. Since $;(T)(v2) #0 and ¢2(T)(v1) # 0 by Theo- 
rem 7.18. The T-annihilator of v3 cannot be ¢; and ¢2. So the final 
possibility of the T-annihilator is ¢,¢9. 


(c) The first one has two blocks but the second one has only one block. 


. By the definition of m;, we know the 


N(Gi(T)™ # N(O(T)™ = N(b(T)"™* = Kg, 


Apply Theorem 7.24 and get the result. 


. If d(T) is not injective, we can find a nonzero element x such that 6(T)(x) = 


0. Hence the T-annihilator p(t) divides ¢(t) by Exercise 7.3.15(b). This 
means p(t) = ¢(t). If f(t) is the characteristic polynomial of T, then 
we have f(T)(x) = 0 by Cayley-Hamilton Theorem. Again by Exercise 
7.3.15(b) we have $(t) divides f(t). 


. Since the disjoint union of (;’s is a basis, each 6; is independent and forms 


a basis of span(8;). Now denote 
W; = span(7;) = span({;). 


Thus V = @;W; by Theorem 5.10. And the set y = U;y; is a basis by 
Exercise 1.6.33. 


Since x € Cy, we may assume x = T’(y) for some integer m. If ¢(t) is 
the T-annihilator of x and p(t) is the T-annihilator of y, then we know 
PT)(x) = p(T)(T™(y)) = T™(o(T)(y)) = 0. Hence p(t) is a factor of 
o(t). If « = 0, then we have p(t) = 1 and y=0. The statement is true for 
this case. So we assume that x + 0. Thus we know that y + 0 otherwise 
x is zero. Hence we know p(t) = $(t). By Exercise 7.3.15, the dimension 
of C, is equal to the dimension of Cy. Since x = T™(y) we know that 
Cy, ¢ Cy. Finally we know that they are the same since they have the 
same dimension. 


(a) Since the rational canonical basis exists, we get the result by Theorem 
5.10. 


(b) This comes from Theorem 5.25. 
Let 6 =U;Cy,. The statement holds by Theorem 5.10. 
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GNU Free Documentation License 


Version 1.3, 3 November 2008 
Copyright © 2000, 2001, 2002, 2007, 2008 Free Software Foundation, Inc. 


<http://fsf.org/> 


Everyone is permitted to copy and distribute verbatim copies of this license 
document, but changing it is not allowed. 


Preamble 


The purpose of this License is to make a manual, textbook, or other func- 
tional and useful document “free” in the sense of freedom: to assure everyone 
the effective freedom to copy and redistribute it, with or without modifying it, 
either commercially or noncommercially. Secondarily, this License preserves for 
the author and publisher a way to get credit for their work, while not being 
considered responsible for modifications made by others. 

This License is a kind of “copyleft” , which means that derivative works of the 
document must themselves be free in the same sense. It complements the GNU 
General Public License, which is a copyleft license designed for free software. 

We have designed this License in order to use it for manuals for free software, 
because free software needs free documentation: a free program should come 
with manuals providing the same freedoms that the software does. But this 
License is not limited to software manuals; it can be used for any textual work, 
regardless of subject matter or whether it is published as a printed book. We 
recommend this License principally for works whose purpose is instruction or 
reference. 


1. APPLICABILITY AND DEFINITIONS 


This License applies to any manual or other work, in any medium, that 
contains a notice placed by the copyright holder saying it can be distributed 
under the terms of this License. Such a notice grants a world-wide, royalty-free 
license, unlimited in duration, to use that work under the conditions stated 
herein. The “Document”, below, refers to any such manual or work. Any 
member of the public is a licensee, and is addressed as “you”. You accept the 
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license if you copy, modify or distribute the work in a way requiring permission 
under copyright law. 

A “Modified Version” of the Document means any work containing the 
Document or a portion of it, either copied verbatim, or with modifications 
and/or translated into another language. 

A “Secondary Section” is a named appendix or a front-matter section 
of the Document that deals exclusively with the relationship of the publishers 
or authors of the Document to the Document’s overall subject (or to related 
matters) and contains nothing that could fall directly within that overall subject. 
(Thus, if the Document is in part a textbook of mathematics, a Secondary 
Section may not explain any mathematics.) The relationship could be a matter 
of historical connection with the subject or with related matters, or of legal, 
commercial, philosophical, ethical or political position regarding them. 

The “Invariant Sections” are certain Secondary Sections whose titles are 
designated, as being those of Invariant Sections, in the notice that says that 
the Document is released under this License. If a section does not fit the above 
definition of Secondary then it is not allowed to be designated as Invariant. 
The Document may contain zero Invariant Sections. If the Document does not 
identify any Invariant Sections then there are none. 

The “Cover Texts” are certain short passages of text that are listed, as 
Front-Cover Texts or Back-Cover Texts, in the notice that says that the Doc- 
ument is released under this License. A Front-Cover Text may be at most 5 
words, and a Back-Cover Text may be at most 25 words. 

A “Transparent” copy of the Document means a machine-readable copy, 
represented in a format whose specification is available to the general public, 
that is suitable for revising the document straightforwardly with generic text 
editors or (for images composed of pixels) generic paint programs or (for draw- 
ings) some widely available drawing editor, and that is suitable for input to 
text formatters or for automatic translation to a variety of formats suitable for 
input to text formatters. A copy made in an otherwise Transparent file format 
whose markup, or absence of markup, has been arranged to thwart or discour- 
age subsequent modification by readers is not Transparent. An image format is 
not Transparent if used for any substantial amount of text. A copy that is not 
“Transparent” is called “Opaque”. 

Examples of suitable formats for Transparent copies include plain ASCII 
without markup, Texinfo input format, LaTeX input format, SGML or XML us- 
ing a publicly available DTD, and standard-conforming simple HTML, PostScript 
or PDF designed for human modification. Examples of transparent image for- 
mats include PNG, XCF and JPG. Opaque formats include proprietary formats 
that can be read and edited only by proprietary word processors, SGML or 
XML for which the DTD and/or processing tools are not generally available, 
and the machine-generated HTML, PostScript or PDF produced by some word 
processors for output purposes only. 

The “Title Page” means, for a printed book, the title page itself, plus such 
following pages as are needed to hold, legibly, the material this License requires 
to appear in the title page. For works in formats which do not have any title 
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page as such, “Title Page” means the text near the most prominent appearance 
of the work’s title, preceding the beginning of the body of the text. 

The “publisher” means any person or entity that distributes copies of the 
Document to the public. 

A section “Entitled XYZ” means a named subunit of the Document whose 
title either is precisely XYZ or contains XYZ in parentheses following text 
that translates XYZ in another language. (Here XYZ stands for a specific sec- 
tion name mentioned below, such as “Acknowledgements”, “Dedications”, 
“Endorsements”, or “History”.) To “Preserve the Title” of such a sec- 
tion when you modify the Document means that it remains a section “Entitled 
XYZ” according to this definition. 

The Document may include Warranty Disclaimers next to the notice which 
states that this License applies to the Document. These Warranty Disclaimers 
are considered to be included by reference in this License, but only as regards 
disclaiming warranties: any other implication that these Warranty Disclaimers 
may have is void and has no effect on the meaning of this License. 


2. VERBATIM COPYING 


You may copy and distribute the Document in any medium, either commer- 
cially or noncommercially, provided that this License, the copyright notices, and 
the license notice saying this License applies to the Document are reproduced 
in all copies, and that you add no other conditions whatsoever to those of this 
License. You may not use technical measures to obstruct or control the reading 
or further copying of the copies you make or distribute. However, you may 
accept compensation in exchange for copies. If you distribute a large enough 
number of copies you must also follow the conditions in section 3. 

You may also lend copies, under the same conditions stated above, and you 
may publicly display copies. 


3. COPYING IN QUANTITY 


If you publish printed copies (or copies in media that commonly have printed 
covers) of the Document, numbering more than 100, and the Document’s license 
notice requires Cover Texts, you must enclose the copies in covers that carry, 
clearly and legibly, all these Cover Texts: Front-Cover Texts on the front cover, 
and Back-Cover Texts on the back cover. Both covers must also clearly and 
legibly identify you as the publisher of these copies. The front cover must 
present the full title with all words of the title equally prominent and visible. 
You may add other material on the covers in addition. Copying with changes 
limited to the covers, as long as they preserve the title of the Document and 
satisfy these conditions, can be treated as verbatim copying in other respects. 

If the required texts for either cover are too voluminous to fit legibly, you 
should put the first ones listed (as many as fit reasonably) on the actual cover, 
and continue the rest onto adjacent pages. 
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If you publish or distribute Opaque copies of the Document numbering more 
than 100, you must either include a machine-readable Transparent copy along 
with each Opaque copy, or state in or with each Opaque copy a computer- 
network location from which the general network-using public has access to 
download using public-standard network protocols a complete Transparent copy 
of the Document, free of added material. If you use the latter option, you must 
take reasonably prudent steps, when you begin distribution of Opaque copies 
in quantity, to ensure that this Transparent copy will remain thus accessible at 
the stated location until at least one year after the last time you distribute an 
Opaque copy (directly or through your agents or retailers) of that edition to the 
public. 

It is requested, but not required, that you contact the authors of the Doc- 
ument well before redistributing any large number of copies, to give them a 
chance to provide you with an updated version of the Document. 


4. MODIFICATIONS 


You may copy and distribute a Modified Version of the Document under the 
conditions of sections 2 and 3 above, provided that you release the Modified 
Version under precisely this License, with the Modified Version filling the role 
of the Document, thus licensing distribution and modification of the Modified 
Version to whoever possesses a copy of it. In addition, you must do these things 
in the Modified Version: 


A. Use in the Title Page (and on the covers, if any) a title distinct from that 
of the Document, and from those of previous versions (which should, if 
there were any, be listed in the History section of the Document). You 
may use the same title as a previous version if the original publisher of 
that version gives permission. 


B. List on the Title Page, as authors, one or more persons or entities respon- 
sible for authorship of the modifications in the Modified Version, together 
with at least five of the principal authors of the Document (all of its prin- 
cipal authors, if it has fewer than five), unless they release you from this 
requirement. 


C. State on the Title page the name of the publisher of the Modified Version, 
as the publisher. 


D. Preserve all the copyright notices of the Document. 


E. Add an appropriate copyright notice for your modifications adjacent to 
the other copyright notices. 


F. Include, immediately after the copyright notices, a license notice giving 
the public permission to use the Modified Version under the terms of this 
License, in the form shown in the Addendum below. 
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G. Preserve in that license notice the full lists of Invariant Sections and re- 
quired Cover Texts given in the Document’s license notice. 


H. Include an unaltered copy of this License. 


I. Preserve the section Entitled “History”, Preserve its Title, and add to it 
an item stating at least the title, year, new authors, and publisher of the 
Modified Version as given on the Title Page. If there is no section Entitled 
“History” in the Document, create one stating the title, year, authors, and 
publisher of the Document as given on its Title Page, then add an item 
describing the Modified Version as stated in the previous sentence. 


J. Preserve the network location, if any, given in the Document for public 
access to a Transparent copy of the Document, and likewise the network 
locations given in the Document for previous versions it was based on. 
These may be placed in the “History” section. You may omit a network 
location for a work that was published at least four years before the Doc- 
ument itself, or if the original publisher of the version it refers to gives 
permission. 


K. For any section Entitled “Acknowledgements” or “Dedications”, Preserve 
the Title of the section, and preserve in the section all the substance and 
tone of each of the contributor acknowledgements and/or dedications given 
therein. 


L. Preserve all the Invariant Sections of the Document, unaltered in their text 
and in their titles. Section numbers or the equivalent are not considered 
part of the section titles. 


M. Delete any section Entitled “Endorsements”. Such a section may not be 
included in the Modified Version. 


N. Do not retitle any existing section to be Entitled “Endorsements” or to 
conflict in title with any Invariant Section. 


O. Preserve any Warranty Disclaimers. 


If the Modified Version includes new front-matter sections or appendices 
that qualify as Secondary Sections and contain no material copied from the 
Document, you may at your option designate some or all of these sections as 
invariant. To do this, add their titles to the list of Invariant Sections in the 
Modified Version’s license notice. These titles must be distinct from any other 
section titles. 

You may add a section Entitled “Endorsements”, provided it contains noth- 
ing but endorsements of your Modified Version by various parties—for example, 
statements of peer review or that the text has been approved by an organization 
as the authoritative definition of a standard. 

You may add a passage of up to five words as a Front-Cover Text, and a 
passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover 
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Texts in the Modified Version. Only one passage of Front-Cover Text and one 
of Back-Cover Text may be added by (or through arrangements made by) any 
one entity. If the Document already includes a cover text for the same cover, 
previously added by you or by arrangement made by the same entity you are 
acting on behalf of, you may not add another; but you may replace the old one, 
on explicit permission from the previous publisher that added the old one. 

The author(s) and publisher(s) of the Document do not by this License give 
permission to use their names for publicity for or to assert or imply endorsement 
of any Modified Version. 


5. COMBINING DOCUMENTS 


You may combine the Document with other documents released under this 
License, under the terms defined in section 4 above for modified versions, pro- 
vided that you include in the combination all of the Invariant Sections of all 
of the original documents, unmodified, and list them all as Invariant Sections 
of your combined work in its license notice, and that you preserve all their 
Warranty Disclaimers. 

The combined work need only contain one copy of this License, and multiple 
identical Invariant Sections may be replaced with a single copy. If there are 
multiple Invariant Sections with the same name but different contents, make 
the title of each such section unique by adding at the end of it, in parentheses, 
the name of the original author or publisher of that section if known, or else a 
unique number. Make the same adjustment to the section titles in the list of 
Invariant Sections in the license notice of the combined work. 

In the combination, you must combine any sections Entitled “History” in 
the various original documents, forming one section Entitled “History” ; likewise 
combine any sections Entitled “Acknowledgements”, and any sections Entitled 
“Dedications”. You must delete all sections Entitled “Endorsements” . 


6. COLLECTIONS OF DOCUMENTS 


You may make a collection consisting of the Document and other documents 
released under this License, and replace the individual copies of this License in 
the various documents with a single copy that is included in the collection, 
provided that you follow the rules of this License for verbatim copying of each 
of the documents in all other respects. 

You may extract a single document from such a collection, and distribute it 
individually under this License, provided you insert a copy of this License into 
the extracted document, and follow this License in all other respects regarding 
verbatim copying of that document. 


7. AGGREGATION WITH INDEPENDENT 
WORKS 
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A compilation of the Document or its derivatives with other separate and 
independent documents or works, in or on a volume of a storage or distribution 
medium, is called an “aggregate” if the copyright resulting from the compilation 
is not used to limit the legal rights of the compilation’s users beyond what the 
individual works permit. When the Document is included in an aggregate, 
this License does not apply to the other works in the aggregate which are not 
themselves derivative works of the Document. 

If the Cover Text requirement of section 3 is applicable to these copies of the 
Document, then if the Document is less than one half of the entire aggregate, the 
Document’s Cover Texts may be placed on covers that bracket the Document 
within the aggregate, or the electronic equivalent of covers if the Document is 
in electronic form. Otherwise they must appear on printed covers that bracket 
the whole aggregate. 


8. TRANSLATION 


Translation is considered a kind of modification, so you may distribute trans- 
lations of the Document under the terms of section 4. Replacing Invariant Sec- 
tions with translations requires special permission from their copyright holders, 
but you may include translations of some or all Invariant Sections in addition to 
the original versions of these Invariant Sections. You may include a translation 
of this License, and all the license notices in the Document, and any Warranty 
Disclaimers, provided that you also include the original English version of this 
License and the original versions of those notices and disclaimers. In case of a 
disagreement between the translation and the original version of this License or 
a notice or disclaimer, the original version will prevail. 

If a section in the Document is Entitled “Acknowledgements”, “Dedica- 
tions”, or “History”, the requirement (section 4) to Preserve its Title (section 1) 
will typically require changing the actual title. 


9. TERMINATION 


You may not copy, modify, sublicense, or distribute the Document except as 
expressly provided under this License. Any attempt otherwise to copy, modify, 
sublicense, or distribute it is void, and will automatically terminate your rights 
under this License. 

However, if you cease all violation of this License, then your license from 
a particular copyright holder is reinstated (a) provisionally, unless and until 
the copyright holder explicitly and finally terminates your license, and (b) per- 
manently, if the copyright holder fails to notify you of the violation by some 
reasonable means prior to 60 days after the cessation. 

Moreover, your license from a particular copyright holder is reinstated per- 
manently if the copyright holder notifies you of the violation by some reasonable 
means, this is the first time you have received notice of violation of this License 
(for any work) from that copyright holder, and you cure the violation prior to 
30 days after your receipt of the notice. 
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Termination of your rights under this section does not terminate the licenses 
of parties who have received copies or rights from you under this License. If 
your rights have been terminated and not permanently reinstated, receipt of a 
copy of some or all of the same material does not give you any rights to use it. 


10. FUTURE REVISIONS OF THIS LICENSE 


The Free Software Foundation may publish new, revised versions of the 
GNU Free Documentation License from time to time. Such new versions will be 
similar in spirit to the present version, but may differ in detail to address new 
problems or concerns. See http://www.gnu.org/copyleft /. 

Each version of the License is given a distinguishing version number. If 
the Document specifies that a particular numbered version of this License “or 
any later version” applies to it, you have the option of following the terms and 
conditions either of that specified version or of any later version that has been 
published (not as a draft) by the Free Software Foundation. If the Document 
does not specify a version number of this License, you may choose any version 
ever published (not as a draft) by the Free Software Foundation. If the Docu- 
ment specifies that a proxy can decide which future versions of this License can 
be used, that proxy’s public statement of acceptance of a version permanently 
authorizes you to choose that version for the Document. 


11. RELICENSING 


“Massive Multiauthor Collaboration Site” (or “MMC Site”) means any World 
Wide Web server that publishes copyrightable works and also provides promi- 
nent facilities for anybody to edit those works. A public wiki that anybody can 
edit is an example of such a server. A “Massive Multiauthor Collaboration” 
(or “MMC”) contained in the site means any set of copyrightable works thus 
published on the MMC site. 

“CC-BY-SA” means the Creative Commons Attribution-Share Alike 3.0 li- 
cense published by Creative Commons Corporation, a not-for-profit corporation 
with a principal place of business in San Francisco, California, as well as future 
copyleft versions of that license published by that same organization. 

“Incorporate” means to publish or republish a Document, in whole or in 
part, as part of another Document. 

An MMC is “eligible for relicensing” if it is licensed under this License, and 
if all works that were first published under this License somewhere other than 
this MMC, and subsequently incorporated in whole or in part into the MMC, 
(1) had no cover texts or invariant sections, and (2) were thus incorporated prior 
to November 1, 2008. 

The operator of an MMC Site may republish an MMC contained in the site 
under CC-BY-SA on the same site at any time before August 1, 2009, provided 
the MMC is eligible for relicensing. 
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ADDENDUM: How to use this License for your 
documents 


To use this License in a document you have written, include a copy of the 
License in the document and put the following copyright and license notices just 
after the title page: 


Copyright ©) YEAR YOUR NAME. Permission is granted to copy, 
distribute and/or modify this document under the terms of the GNU 
Free Documentation License, Version 1.3 or any later version pub- 
lished by the Free Software Foundation; with no Invariant Sections, 
no Front-Cover Texts, and no Back-Cover Texts. A copy of the li- 
cense is included in the section entitled “GNU Free Documentation 
License” . 


If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, 
replace the “with ... Texts.” line with this: 


with the Invariant Sections being LIST THEIR TITLES, with the 
Front-Cover Texts being LIST, and with the Back-Cover Texts being 
LIST. 


If you have Invariant Sections without Cover Texts, or some other combina- 
tion of the three, merge those two alternatives to suit the situation. 

If your document contains nontrivial examples of program code, we recom- 
mend releasing these examples in parallel under your choice of free software 
license, such as the GNU General Public License, to permit their use in free 
software. 
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Appendices 


Some Definition in Graph Theory 


Definition. A graph is a ordered pair of sets (V, FE) and E is the set of pair 
of elements of V. A directed graph, or a digraph is a ordered pair of sets 
(V, E) and E is the set of ordered pair of elements of V. And we call V be the 
set of vertices and E be the set of edges. A tournament is a digraph such that 
for each pair of vertices we have exactly one edge connecting them. 


Example. Let G(Vi, £1) be a graph with vertex set V; = {1,2,3,4} and edge 
- Ey = {(1, 2), (2,2), (2,3), (3, 1)}- 

And let D(V2, E2) be a digraph with vertex set Vo = {1,2,3,4} and edge set 
Ey = {(1,2), (2; 1), (2,2), (2,3), (3,2), (1,3), (3; 1), (1,4), (4,3)}- 
Finally let T(V3, £3) be a digraph with vertex set V2 = {1,2,3,4} and edge set 
E3 = {(1,2), (1,3), 1,4), (2,3), (2,4); (3; 4) }- 


Thus T is a tournament. And we can draw G, D, T as follow. 


G D T 


Definition. A path of length k& in a graph or digraph is a sequence of distinct 
vertices {V1,V2,---,Ups1} such that (v;, vj41) € E. A walk of length & in a graph 
or digraph is a sequence of vertices {v1,v2,...,Uk+1} such that (v;,vi41) € EB. A 
loop is an edge of the form (v, v). 
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Example. Let G and D be the graph and digraph defined above. Then we 
have 1,2,3 is a path and 1,2,1,3 is a walk and (2,2) is a loop in the graph G. 
And we have 1,2,3 is a path and 1,4,3,1 is a walk and (2,2) is a loop in the 
digraph D. 


Definition. In a graph, degree of a vertex v, denoted by d(v), is the number 
of edge adjacent to v. That is, number of elements in edge set of the form (-, v). 
For convenience, we say that a loop contribute a vertex degree 1. In a digraph, 
out degree of a vertex v, denoted by d*(v), is the number of edges of the form 
(v,-); while in degree of a vertex v, denoted by d-(v), is the number of edges 
of the form (-,v). For convenience, we say that a loop contribute a vertex out 
degree 1 and in degree 1. 


Example. In the graph G above, we have d(1) = d(3) = 2, d(2) = 4,and d(4) = 0. 


In the digraph D above, we have d*(1) = d*(2) = 3, d*(3) = 2, d*(4) = 1 and 
d (1) = 2, d (2) =d (3) =3, d (4) =1 


Definition. For an nxn symmetric matrix A, we can associated a graph G(A) 
with it. The graph has vertex set {1,2,...,n} and edge set {(7,j) : aij # O}. For 
an nxn matrix B, we can associated a digraph G(A) with it. The digraph has 
vertex set {1,2,...,n} and edge set {(7,7) : aj; +0}. And the incidence matrix 
of a graph is the matrix setting a;; = aj; = 1 if (i,j) is an edge and a;; = 0 for 
otherwise. And the incidence matrix of a digraph is the matrix setting aj; = 1 if 
(2,7) is an edge and a;,; = 0 for otherwise. So we have an incidence matrix with 
dominance relation is actually the incidence matrix of a tournament. 


Definition. A cliqud'] is the maximal set such that each vertex connects to 
each others. For digraph, v connect to u means (v,u), (u,v) are elements in the 
edge set. 


Note. By induction and some arguement, we can prove that if A is inci- 
dence of a graph(digraph) then (A*),,; is the number of walk from i to j of 
a graph(digraph). 


1This definition is different from the “clique” in general Graph Theory. In general, a clique 
means a subset of vertex of some graph such that each vertices are adjacent to each others. 
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